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INTRODUCTION 


Our  research  for  this  period  is  sununarized  in  five  parts  below. 
In  part  A  we  extend  our  results  on  adaptive  routing  to  further  connect 
the  two  levels  —  global  and  local.  In  part  A.l  we  present  an  efficient 
global  routing  algorithm  which  minimizes  the  maximum  utilization  in  a 
link.  This  assures  stability  and  generates  required  flows  for  the  local 
level.  It's  simple  objective  function  results  in  a  very  fast  implementa¬ 
tion.  In  part  A. 2  we  reconsider  the  local  level  optimization  but  with 
output  flow  constraints.  We  show  that  there  is  a  reasonable  amount  of 
control  embedded  in  our  priority  rule.  We  also  pose  the  problem  as  a 
Markov  Decision  Process,  identify  when  deterministic  policies  are  opti¬ 
mal,  and  show  for  a  simple  example  that  a  threshhold  policy  is  optimal. 

In  part  B  we  present  several  results  which  generalize  the  analyses 
of  random  access  in  ALOHA,  CSMA,  and  CSMA/CD  to  include  arbitrary 
packet  length  distributions.  In  part  C  we  extend  our  work  on  Multihop 
Packet  Radio  Networks  to  include  non-exponential  packet  length  distri¬ 
butions  and  present  more  examples,  especially  the  throughput  of  a  "hot 
spot. " 

In  parts  D  and  E  we  summarize  some  results,  on  algorithms  for 
Centralized  Teleprocessing  Network  Design  and  on  augmenting  paths  for 
Combinatorial  Optimization  Problems. 


RESEARCH  SUMMARIES 


A.  Studies  in  Adaptive  Routing 
A.l.  Stable  Routing  Patterns 

As  part  of  a  general  investigation  in  the  area  of  dynamic  routing 
in  computer  communication  networks  we  consider  the  problem  of  finding 
stable  global  routing  patterns.  Specifically,  we  have  proposed  [1]  a 
two-level  routing  procedure  where  the  lower  (local)  level  adapts  dy¬ 
namically  to  instantaneous  variations  in  the  congestion  of  the  network  in 
the  immediate  vicinity  of  each  node  and  the  higher  (global)  level  en¬ 
sures  stability  by  keeping  the  average  load  across  the  entire  network  in 
some  sense  globally  balanced.  We  now  consider  this  latter  problem. 

We  consider  as  a  measure  of  global  balance  the  utilization  of  the 
most  heavily  utilized  network  element  (node  or  link)  and  seek  to  mini¬ 
mize  this  quantity.  For  simplicity,  we  will  speak  only  of  link  utiliza¬ 
tions.  (Node  utilizations  can  be  included  in  a  straightforward  manner.) 
Thus,  we  are  given  a  network  containing  N  nodes  and  L  (directed) 
links.  Each  link,  1,  has  a  capacity  Cj.  There  are  (directed)  require¬ 
ments  r-  between  nodes  i  and  j.  Each  r-  is  satisfied  by  routing  it  on 
one  or  more  paths  from  i  to  j.  (In  the  two  level  adaptive  routing 

scheme,  these  paths  (or  links  within  them)  will  be  the  alternatives  open 
to  each  requirement.)  A  routing  pattern  is  defined  by  the  paths 
and  the  fraction,  f^^.^  of  r..  using  each  The  utilization  of  each 

link  is  equal  to  the  total  flow  (sum  of  fractions  of  requirements)  on  the 
link  divided  by  its  capacity. 

The  maximally  utilized  link  is  in  a  sense  the  most  vulnerable  part 
of  the  network  and  the  most  likely  cause  for  the  dynamic  routing  mech¬ 
anism  to  break  down  (e.g.  loop)  due  to  congestion.  By  minimizing  the 


2 


utilization  of  the  maximally  utilized  link  we  seek  to  minimize  the  chance 
of  congestion  leading  to  such  failure.  It  should  be  noted  that  we  are 
dealing  with  the  global  level  of  the  routing  procedure  here  and  as  such 
consider  only  long  term  average  utilizations,  not  instantaneous  meas- 
ures.  The  local  level  of  the  routing  procedure  concerns  itself  with 
making  decisions  instantaneously  on  the  basis  of  the  local  state  of  the 
network  in  the  vicinity  of  a  node.  Even  within  the  constraints  of  a 
given  P^^^and  defined  by  the  global  strategy,  the  local  level  has 

considerable  flexibility  in  choosing  when  to  use  each  route  and  as  such 
can  obtain  substantial  reductions  in  delay  when  compared  with  static 
routing  policies. 

We  now  turn  to  the  problem  of  actually  finding  the  optimal  global 
routing  pattern  as  defined  above.  The  technique  resembles  the  Flow 
Deviation  Method  of  Cantor  and  Gerla  [2]  and  will  be  described  in 
similar  terms.  Cantor  and  Gerla  sought  to  minimize  the  average  delay 
whereas  we  seek  to  minimize  the  maximum  utilization  of  a  link.  Both 
functions  are  convex  functions  over  a  convex  region  and  as  such,  the 
same  type  of  procedure  can  be  proven  to  yield  an  optimal  routing 
pattern.  The  proof  is  given  in  [2]. 

Our  function  is  only  piecewise  differentiable  and  as  such,  the 
gradient  search  used  in  [2]  is  not  appropriate.  In  fact,  the  alternative 
described  here  takes  the  special  nature  of  the  objective  function,  a 
minimum  of  linear  functions,  into  account  and  not  only  overcomes  its 
non-differentiability  but  also  is  considerably  more  efficient  and  easier 
to  implement  than  a  gradient  search. 

We  now  give  an  outline  of  the  optimization  procedure.  A  high 
level  flowchart  of  this  procedure  is  given  in  Figure  1.  As  mentioned 
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above,  at  this  level  the  procedure  is  almost  identical  to  the  Flow  De¬ 
viation  Algorithm.  The  key  difference,  which  is  only  evident  in  a  more 
detailed  description,  is  how  the  optimal  superposition  of  flows  is  found. 

We  define  the  length  of  a  link  to  be  its  utilization.  Initially,  we 
set  all  link  lengths  to  0.  We  define  the  length  of  a  path  to  be  the 
length  of  the  longest  link  in  the  path  plus  a  small  constant  times  the 
number  of  links  in  the  path.  This  latter  term  is  added  to  break  ties 
among  paths  with  equally  utilized  links  in  favor  of  a  path  with  the 
smallest  number  of  links.  Note  that  this  definition  of  path  length  is 
different  from  the  conventional  one  but  it  serves  our  purpose.  Short¬ 
est  paths  using  this  metric  are  computable  using  conventional  shortest 
path  algorithms . 

The  routing  pattern  found  at  each  stage  in  the  optimization  proce¬ 
dure  is  a  single  shortest  path  for  each  requirement  r...  (In  general, 
this  path  is  not  unique,  but  this  poses  no  problem.)  A  flow  pattern  is 
defined  as  the  total  flow  in  each  link  and  is  found  by  loading  the 
requirements  onto  the  links  specified  in  the  current  flow  patterns. 

An  optimal  superposition  of  the  current  flow  pattern  with  all  pre¬ 
vious  flow  patterns  is  then  found.  This  is  the  key  step  in  the  proce¬ 
dure  and  is  done  by  finding  the  value  of  A.  between  0  and  1  which 
minimizes  V,  the  maximum  link  utilization,  where  k  represents  the  frac¬ 
tion  of  all  previous  flow  patterns  used.  The  new  optimal  superposition 
of  flows  is  then  \  times  the  previous  superposition  plus  (l-\)  times  the 
current  flow  pattern.  A  new  superposition,  and  hence  link  utilizations, 
is  then  obtained.  This  in  turn  yields  a  new  value  for  V(K)  and  the 
link  lengths.  We  can  now  start  another  iteration.  If,  however,  no 
improvement  in  V(K)  has  been  observed,  the  iteration  has  converged 
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and  we  terminate  the  procedure  with  an  optimal  flow  pattern.  By 
saving  the  routing  patterns  and  the  values  of  \  for  each  K,  the 

optimal  routing  pattern  can  be  obtained.  In  particular,  if  is  the 
(i,j)  path  first  used  in  the  routing  pattern  in  iteration  K  then  the 
fraction  of  commodity  (i,j)  using  P^^j^is 

n 

j=K+l 

where  M  is  the  number  of  iterations  and  the  product  is  defined  equal  to 
1  for  K  =  M. 

We  now  turn  to  the  problem  of  how  to  find  the  optimal  super¬ 
position  of  flows.  Consider  two  flow  patterns,  and  Each 

flow  pattern  assigns  a  flow  to  each  link.  Thus  f^^.^  and  f^?.^  are  the 
flows  assigned  to  link  (i,j)  in  F^^^  and  F^^\  respectively.  For  any  \ 
between  0  and  1,  the  flow  assigned  to  link  (i,j)  by  superposing  \F^^^ 
and  (l-A)F^^^  is  then 

(l-A)f^?.^ 

which  equals 

which  is  a  linear  function  of  A.  Dividing  by  C^,  to  obtain  utilizations, 
there  will  be,  in  general,  a  different  function,  aj  +  bjA  for  each  link  1. 
(We  will  for  simplicity  refer  to  links  by  a  single  index,  1,  rather  than 
endpoints  (i,j).) 

We  seek  the  value  of  A  which  minimizes  the  maximum  of  these 
functions  over  all  1.  Several  simple  observations  allow  us  to  find  this 
value  of  A  in  an  efficient  and  straightforward  manner.  First,  if  for  two 
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links,  1  and  tn,  a,  >  and  b,  >  then  link  1  is  said  to  dominate  link 
m  and  link  m  can  be  ignored  as  it  clearly  does  not  participate  in  the 
maximvun  since  a,  +  b,\  >  a„  +  b„A  for  all  values  of  Indeed,  if  a,  + 
bfX  i  ^  between  0  and  1  then  link  m  may  be  ignored. 

Note  that  this  latter  condition  is  not  equivalent  to  the  former,  for 
example  if  a^  =  10,  bj  =  0,  ~  ^m  ” 

We  can  then  arrange  the  links  1  in  descending  order  of  a^  and 

examine  the  bi.  Any  link  m  for  which  b  does  not  exceed  b, ,  where  1 
is  the  predecessor  of  m  in  the  order,  can  be  eliminated.  We  now  have 
an  ordering  of  links  which  is  descending  in  aj  and  ascending  in  bj.  We 
then  compute,  for  each  adjacent  pair  of  links  1  and  m,  the  value  for 
which  aj  +  bjXj  =  The  \j  should  form  an  ascending  se¬ 

quence.  A  value  of  \j  which  is  less  than  the  value  of  its  predecessor 
in  the  sequence  corresponds  to  a  link  1  which  can  be  eliminated  from 
further  consideration.  In  this  case,  link  1  is  eliminated  and  m  has  a 
new  predecessor.  The  \  is  then  recomputed  for  link  p  the  new  pre- 

decessor  of  m  and  this  process  is  repeated.  For  link  n,  the  last  link 

in  the  sequence,  =  1- 

We  now  compute  for  each  remaining  link  1  Vj  =  aj  +  bjXj  and  select 
the  minimum  of  these  values.  The  resulting  \j  and  Vj  are  the  desired 
values  yielding  the  optimal  superposition. 

This  entire  process  is  illustrated  in  Figure  2.  The  links  have 
been  sorted  so  that  the  a^  are  descending.  Links  with  nonascending  bj 
have  already  been  eliminated.  Thus  the  bj  form  an  ascending  sequence. 
This  is  evident  in  Figure  2  by  the  fact  that  the  lines  form  a  sequence 
increasing  in  slope.  The  intersection  of  lines  1  and  2  (i.e.  the  lines 
starting  at  a^  and  a2)  defines  Similarly,  the  intersection  of  lines 
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2  and  3  defines  and  ^  ^  So  thus  far  no  line  is  dominated.  The 
intersection  of  lines  3  and  4  takes  place  between  \  =  1  so  line  4  is 
dominated  by  line  3  and  line  4  is  thus  eliminated  from  further  consider¬ 
ation.  The  intersection  of  lines  3  and  5  defines  a  value  of  (dotted 
line),  but  when  is  computed  we  find  it  to  be  less  than  So,  line 
5  is  dominated  by  line  6  and  removed  from  further  consideration,  k^  is 
then  recomputed  from  the  intersection  of  lines  3  and  6.  ■^g  is  computed 
from  the  intersection  of  lines  6  and  7.  Finally  =  1.  This  leaves  us 
with  X^,  X2,  X^,  Xg,  and  kj  (also  Xq  =  0).  We  search  among  the  cor¬ 
responding  v^  and  find  v^  is  minimum.  It  and  X^  define  the  desired 
superposition. 

The  entire  optimization  process  is  illustrated  in  Figures  3,  4,  and 
5.  The  network  consisting  of  3  nodes,  6  links  and  6  requirements  is 
shown  in  Figure  3.  For  simplicity,  we  assume  symmetric  requirements 
and  link  capacities .  We  can  thus  assume  a  symmetric  solution ,  i .  e . , 
routes  for  r..  the  reverse  of  routes  for  rj.  and  equal  utilization  of  each 
link  in  both  directions.  This  allows  us  to  only  consider  3  links  and  3 
requirements  in  the  example.  This  is  done  to  simplify  the  example. 
The  actual  procedure  works  with  directed  links  and  requirements.  It 
can  also  be  used  with  undirected  links  and  requirements  but  such  a 
situation  is  rarely  physically  meaningful. 

Initially,  all  requirements  are  routed  directly  since  the  initial 
shortest  paths  by  our  definition  would  be  the  paths  with  the  minimum 
number  of  links.  This  is  illustrated  in  Figure  4a.  The  link  lengths 
are  then  recomputed  --  link  1  has  a  utilization  of  .3  and  hence  a  length 
of  .3,  etc.  The  shortest  paths  are  then  recomputed  and  are  shown  in 
Figure  4b.  Note  that  the  shortest  path  from  B  to  C  is  now  B-A-C. 
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The  requirements  are  loaded  onto  these  paths.  The  flow  pattern  is 
shown  in  Figure  4c. 

Now  a  superposition  of  the  flow  patterns  in  Figures  4a  and  4c  is 
done.  Figure  5  illustrates  the  dynamics  of  this.  Note  that  r^^  domi¬ 
nates  r^g  and  that  the  optimal  X.  is  .9  and  v  =  4.5.  Figure  4d  shows 
the  resultant  routing  pattern  formed  by  using  the  first  routes  for  90% 
of  the  traffic  and  the  second  routes  for  the  remaining  10%.  Figure  4e 
shows  the  flow  pattern  resulting  from  the  superposition.  Note  that  the 
maximum  utilization  is  .45  (in  links  (A.C)  and  (B,C))  which  i  'ss 
than  the  maximum  in  either  of  the  patterns  in  Figures  4a  and  4c. 

We  now  recompute  the  link  lengths  and  the  shortest  paths  'he 
resultant  routes  are  the  same  as  in  Figure  4a.  An  optimal  superpt  .aon 
between  this  flow  pattern  and  the  one  in  Figure  4e  is  then  done.  The 
optimal  value  of  \  is  1,  no  improvement  in  v  is  found  and  we  conclude 
that  the  routing  and  flow  pattern  in  Figures  4d  and  4e  are  optimal. 
Note  that  the  links  (A,C)  and  (B,C)  are  both  maximally  utilized.  They 
form  a  cut  which  is  analogous  to  the  saturated  cut  in  Gerla's  Cut  Satu¬ 
ration  Method.  (The  existence  of  such  a  cut  is  a  necessary  condition 
for  the  optimality  of  a  flow  pattern.) 

We  thus  have  developed  a  simple  and  efficient  algorithm  for  obtain¬ 
ing  stable  flow  patterns  for  use  globally  as  the  higher  level  in  our  2 
level  adaptive  routing  procedure.  In  the  coming  months  we  hope  to 
implement  this  procedure  and  experiment  with  it. 

In  an  allied  study  we  investigated  a  pattern  for  placing  virtual 
calls  on  a  network.  A  simulation  program  was  written  to  directly  ob¬ 
serve  the  dynamic  performance  of  an  algorithm  which  loads  calls  on 
alternate  routes  according  to  the  following  algorithm: 
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1.  Load  each  incoming  call  onto  the  route  currently  carrying  the 
smallest  number  of  calls.  (The  number  of  calls  carried  by  a 
route  is  defined  for  the  purposes  of  this  algorithm  to  be  the 
number  of  calls  on  the  first  link  in  the  route.) 

2.  If  there  is  a  tie  among  several  routes  in  a  set,  S,  select 
route  i  with  probability  P-(S). 

The  simulation  was  written  to  provide  us  with  a  first  glimpse  of 
the  dynamic  performance  of  such  a  procedure  as  a  guide  for  further 
research  in  this  area.  We  thus  wanted  to  keep  it  as  simple  as  possible 
and  considered  a  3  node  6  link  network  as  shown  in  Figure  5a  with 
symmetric  requirements.  The  program  can  easily  be  expanded  to  con¬ 
sider  more  general  cases  but  we  chose  this  simple  one  initially  in  order 
not  to  obscure  the  basic  results. 

Calls  arrive  at  each  node  at  a  rate  \  (Poisson)  and  are  served  at 
rate  p  (exponential)  by  the  links,  i.e.,  have  exponential  duration  with 
average  length  1/p.  Each  call  has  a  choice  of  a  1  hop  path  or  a  2  hop 
path.  A  call  arriving  at  a  node  is  equally  likely  to  be  destined  for 
either  other  node.  Thus,  there  is  total  symmetry  in  the  system.  It 
should  be  noted  that  a  call  taking  a  2  hop  path  occupies  2  links  but 
remains  in  the  system  for  time  1/p  on  average  (not  2/p). 

The  simulation  is  straightforward.  Call  arrivals  are  generated 
randomly  and  arriving  calls  are  routed  accord’ng  to  the  algorithm  given 
above.  The  number  of  calls  taking  the  1-hop  and  2-hop  routes  were 
recorded  for  each  run.  A  parameter,  a,  determined  the  probability  of 
taking  the  1-hop  route  when  there  was  a  tie  between  the  2  routes  (a  = 
Prob  {using  the  1-hop  route  in  case  of  a  tie}). 
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For  (1  =  .5  the  fraction  of  calls  taking  the  1-hop  route  was,  not 
surprisingly,  very  close  to  For  or  =  0,  however,  the  fraction  varied. 
For  A,/p  =  1,  80%  of  the  calls  took  the  2-hop  path.  For  \/p  =  10,  66%  of 
the  calls  took  the  2-hop  path.  For  \/p  =  50,  65%  of  the  calls  took  the 
2-hop  path.  For  a  =  1,  the  results  (fraction  on  1-hop  versus  fraction 
on  2-hop  paths)  reversed  relative  to  the  results  for  a  =  0. 

We  thus  conclude  that  we  have  some  control  but  not  total  control 
over  the  routing  via  a  which  only  operates  during  a  tie.  The  control 
gets  greater  for  systems  with  a  smaller  number  of  calls  in  progress,  as 
evidenced  by  the  results  for  smaller  values  of  \/p  (which  is  directly 
related  to  the  number  of  calls  in  the  system).  Observations  of  the 
number  of  calls  in  the  system  at  various  points  in  a  simulation  run  led 
to  the  conclusion  that  the  system  is  stable,  i.e.,  that  the  link  loads 
reach  a  stable  level  and  remain  close  to  that  point  and  close  to  one 
another. 


FIGURE  1 


OPTIMIZATION  PROCEDURE 


FIGURE  2 

FLOW  SUPERPOSITIONS 
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FIGURE  3 

NETWORK  AND  REQUIREMENTS 


FIGURE  4a  FIGURE  4b  FIGURE  4c 

Routing  Pattern  1  Routing  Pattern  2  Flow  Pattern  2 

=  Flow  Pattern  2 


FIGURE  4d  FIGURE  4e 

Superposition  Superposition 

of  Routing  of  Flows 

Patterns 
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A. 2  Locally  Adaptive  Routing 

For  routing  at  the  local  level  we  proposed  a  priority  routing  al¬ 
gorithm  [1].  Its  function  is  summarized  in  Figure  6  whereby  dedicated 
traffic,  i.e.  those  without  choice,  join  queues  and  Q2  in  order  to 
obtain  service  on  links  A  and  B  respectively.  Traffic  with  choice  join 
the  middle  queue  and  is  served  on  either  A  or  B  whenever  or  Q2 
is  empt^'.  Assuming  arrival  rates  equal  service  times  p,  the 

service  utilizations  and  pg  are  given  in  the  Figure.  The  parameter 
a,  denotes  the  fraction  of  the  non-dedicated  traffic  which  is  served  by 
A.  A  heuristic  estimate  for  a  is  given  by 


Solving  for  a  we  obtain 


2-P1-P2 


This  assumes  that  packets  from  join  A  in  proportion  to  the  proba¬ 
bility  that  is  empty.  It  has  been  shown  that,  by  serving  the  non- 
dedicated  traffic  in  idle  periods  of  Q,  and  Q-,  the  system  achieves 
close  to  two-server  (M/M/2)  behavior,  thus  reducing  the  packet  delay 
(averaged  over  the  three  queues)  by  a  factor  approaching  2  when 
compared  to  two  independent  M/M/1  queues.  The  latter  models  the 
random  bifurcation  with  the  non-dedicated  traffic  joining  the  ends  of 
queues  and  Q2  according  to  a  random  rule.  The  introduction  of  the 
middle  queue  is  the  essential  key  to  improving  delays. 

An  additional  form  of  control  exists  in  our  priority  rule,  and  may 
be  used  to  achieve  certain  values  of  p^  and  pg  without  destroying  the 
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multiserver  behavior.  This  is  possible,  by  taking  advantage  of  non- 
dedicated  arrivals  to  a  totally  empty  system.  Then  both  servers  are 
idle,  and  the  packet  c  n  be  i^ent  to  A  >  r  B  with  preset  probabilities  ^ 
and  1  "  p,  respectively. 

Let  Pj.  =  Prob.  {Both  servers  are  idle} 

Then  Pr  {only  A  is  idle}  '  ^ 

Pr  {only  B  is  idle}  “  ^  ■  Pg  ”  P£ 

Pr  {at  least  one  server  is  idle}  =  2(1  -  p)  -  P^. 

The  fraction  of  non-dedicated  traffic  which  joins  A  can  be  modified 
to 


1  -  p.  -P.  +pP„  1  -  p,  -  Pp(l  -  p) 

2(1 ---P)  -  f/  -  2-^-T2-  V 


(Our  previous  results,  apparently  ignoring  p,  actually  assumed  p  » 

1  -  p2/2-Pj^-P2.) 

Since  Pj.  cannot  be  evaluated  in  a  closed  form,  we  obtain  bounds. 
An  upper  bound  is  the  M/M/2  probability  of  an  empty  system.  This  is 
true  since  such  a  service  mechanism  assumes  that  all 
can  use  either  server  and  thus  achieves  the  best  utilization.  A  lower 
bound  is  for  an  M/M/1  queue.  Thus 


K2,  ^3  traffic 


(1  -  p^)(l  -  P3)  <  Pg  <  . 


A  looser  but  more  usable  lower  bound  is  1  -  2p. 

As  an  example  let  p  =  Then  S  1  P£  1  3/5.  If  P^  =  P2  =  P/2, 

then  for  p  =  1,  a  >  7/10  and  for  p  =  0,  a  <  3/10.  The  range  of  control 
using  p  is  at  least  40%  of  the  total. 
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(Numerical  results  to  come) 
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The  interaction  between  the  global  quasi-static  routing  and  the 
locally  adaptive  strategy  lies  in  determining  average  link  flows  at  the 
high  level  and  designing  the  local  policy  in  order  to  achieve  them.  In 
terms  of  the  node  model  of  Figure  7,  the  local  problem  can  be  formu¬ 
lated  as  follows: 

Given  the  input  traffic 
and  the  desirable  output  traffic  Xg,  where  ^  ^  ^  ^Xg, 

minimize  the  average  queuing  delay.  In  the  preceding  section,  we 
demonstrated  how  we  can  control  X^  to  a  certain  degree  using  our 
priority  scheme  and  adjusting  p  (the  probability  of  routing  a  packet 
for  A  when  both  servers  are  idle).  The  impact  of  p  can  be  significant 
especially  under  low  utilizations,  but  cannot  always  achieve  the  whole 
spectrum  of  X^  between  X 
extreme  values,  the  non-dedicated  traffic  becomes  fully  dedicated  and 
delays  are  that  of  two  independent  M/M/1  queues.  At  values  of  X^  in 
the  middle  of  the  allowable  range,  variations  of  p  will  provide  the 
desirable  values  with  delays  close  to  the  M/M/2  lower  bound.  At  the 
extremes,  we  may  have  to  violate  the  priority  scheme  and  either  defer 
service  of  a  non-dedicated  packet  in  order  to  control  its  flow,  or  occa¬ 
sionally  break  the  priority  rule  in  order  to  serve  a  non-dedicated 
packet.  Our  objective  is  still  to  be  able  to  fill-in  idle  periods  of 
Q2  with  non-dedicated  packets  as  much  as  possible  in  order  to  minimize 
delays.  From  a  preliminary  investigation  of  the  problem,  we  arrived  at 
the  following  observations: 

A.  No  action  need  be  taken  while  the  system  remains  in  a  state. 
Actions  may  have  to  be  taken  at  transition  times  of  the  state 
vector.  By  actions  we  mean  deferment  of  service  or  breaking 


^  and  Xj^  +  X^.  As  shown  in  Figure  7,  at  the 


X2  (dedicated)  and  X^  (n'tn-dedicated) 
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of  priority.  By  transition  times  we  mean  any  arrival  or 
departure  time  in  any  of  the  queues  and  servers  of  the 
system.  This  observation  follows  from  the  Markov  nature  of 
the  system. 

B.  A  countable  number  of  points  in  the  optimal  delay  vs. 

curve  can  be  obtained  via  deterministic  policies ,  whereby 
actions  are  assigned  to  states  with  probability  one.  This 
follows  from  formulating  the  truncated  state  space  problem  as 
a  finite  state  Markov  Decision  process  with  average  cost 
minimization  under  a  state  frequency  constraint  (the  specified 
value  of  X^).  It  is  known  that  such  problems  lead  to  proba¬ 
bilistic  (randomized)  action-state  assignments  in  general  [3]. 
However  by  incorporating  the  constraint  within  the  objective 
function  (average  delay)  using  a  Lagrange  multiplier,  the 
problem  becomes  an  unconstrained  optimization  problem,  which 
leads  to  deterministic  policies  [4] .  Due  to  the  discrete  finite 
state  and  action  spaces,  only  a  finite  number  of  optimal  points 
can  be  found  using  Lagrange  multipliers. 

A  simplified  version  of  the  node  optimization  problem  can  be  ob¬ 
tained  by  deleting  the  dedicated  queues  Q, ,  Q^.  Then  the  problem  can 
be  formulated  as  an  optimal  control  of  output  flows  in  an  M/M/2  queue 
as  in  Figure  8.  The  same  arguments  mentioned  above  indicate  the 
nature  of  the  optimal  policy  as  shown  in  the  figure.  We  do  have  strong 
indications  that  the  deterministic  policies  are  simple  threshold  schemes, 
whereby  packets  defer  using  server  B  (if  X^  >  Xg)  unless  the  number 
of  packets  waiting  in  queue  Q  exceeds  a  threshold  K.  Values  of  X^ 
in-between  two  threshold  policies  can  be  obtained  by  implementing  a 
random  choice  at  the  threshold. 
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OPTIMUM  DELAY  IN  M/M/2  QUEUES  WITH  SPECIFIED  K 


B.  Multiple  Access  Techniques  with  Arbitrary 

Packet  Length  Distributions 
B.l  Introduction 

In  our  original  multihop  packet  radio  analysis,  we  assumed  expo¬ 
nentially  distributed  packet  lengths.  We  have  been  able  to  generalize 
the  analysis  for  packet  lengths  having  densities  formed  by  the  positive 
sum  of  exponential  terms  (see  Appendix  C).  In  our  analysis  we  as¬ 
sumed  that  propagation  delays  among  neighboring  PRU's  are  negligible. 
Thus  in  a  Carrier  Serving  Multiple  Access  (CSMA)  mode  of  operation, 
collisions  may  occur  due  to  the  "hidden  terminal"  phenomenon  only 
(i.e.,  two  non-communicating  PRU's  schedule  packet  transmissions  to  a 
common  neighbor  simultaneously). 

CSMA  analyses  incorporating  the  effects  of  propagation  delays  have 
been  reported  extensively  in  the  literature  for  single-hop  networks 
(i.e.,  all  PRU's  hear  each  other)  and  fixed  packet  sites.  As  a  first 
step  in  generalizing  these  results,  we  studied  single-hop  multiple  access 
protocols  with  non-fixed  packet  lengths.  Although  our  main  thrust  is 
on  CSMA  packet  radio,  we  also  derived  formulas  for  pure  ALOHA  and 
CSMA  with  collision  detection  (CSMA/CD).  The  former  was  a  necessary 
step  in  order  to  demonstrate  the  impact  of  packet  length  distribution  on 
the  simplest  multiple  access  method,  whereas  the  latter  is  a  straight¬ 
forward  extension  of  pure  CSMA  and  is  especially  popular  in  local 
networking  environments.  Note  that  the  pure  ALOHA  case  was  studied 
previously  [5],  whereas  no  extension  has  been  reported  on  CSMA  to  our 
knowledge.  The  CSMA/CD  result  is  so  simple  that  it  may  already  be 
known . 
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In  what  follows,  we  summarize  the  variable  packet  length  analyses 
in  pure  ALOHA,  CSMA  and  CSMA/CD.  In  all  cases  we  assumed  infi¬ 
nitely  many  Poisson  sources  and  Poisson  aggregate  scheduling  pro¬ 
cesses,  with  rates  s  and  g  packets/sec  respectively.  Packet  lengths 
are  distributed  arbitrarily. 

B.2  Pure  ALOHA 

Referring  to  Figure  9,  we  consider  a  transmission  of  length  Y 
(shaded).  This  transmission  will  be  successful  if  a)  no  other  packet  is 
transmitted  in  Y  seconds  and  b)  no  previously  transmitted  packet  is 
still  transmitting.  We  are  assuming  zero  capture.  Calling  these  prob¬ 
abilities  P^  and  Pj^,  we  obtain  s  =  But 

00 

P^  =  J  e'^^  fY(y)  dy  =  MY(-g) 

^  0 

where 

00 

MY(g)  =  f  e^  fY(y)dy 
0 

is  the  moment  generating  function  of  Y  and  fY(y)  is  its  density.  P^  is 
found  by  considering  the  T  second  interval  prior  to  the  transmission  in 
question.  Assume  transmissions  in  that  interval  occur  T-  seconds 
before  the  start  of  our  test  transmission  and  have  length  Yj.  Then 

P.  =  lim  P(all  T.  >  Y.) 

But  the  number  of  transmissions  in  T  is  Poisson  and  all  are  identically 
distributed  and  independent.  Therefore 
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P(all  T;  >  Y.)  =  1  [P(T 
^  ^  k=0 

-gT[l-P(Tj  >  Y.)]  -gTP(Tj  <  Y.) 

=  e  =  e 

Here  Tj  is  uniform  in  the  interval  (0,T)  and  Y.  is  distributed  as  Y. 
They  are  independent.  Thus 

1 

P(Tj  <  vp  =  i  ;  [1  -  FyCt)]  dt 

and 

OD 

TP(T.  <  Y.)  -  ;  [1  -  F„(t)]  dt  =  E(Y) 

1  i  0  ^ 

Here  FyCy)  is  the  distribution  function  of  Y  and  E(Y)  its  expectation. 
Finally  we  have 

s  =  gMyC-g)  . 

Note  that  if  Y  is  fixed  then  s  =  ge~^^  as  it  should. 

But  Y  is  the  length  of  the  transmitted  packets.  Condition  (b) 
above  does  not  involve  the  length  of  the  transmitted  packet  Y.  But 
condition  (b)  does!  Longer  packets  are  more  likely  to  suffer  a  colli¬ 
sion.  Let  X  be  the  length  of  the  offered  (or  successful)  packets.  Y 
should  in  a  sense  be  larger  since  longer  packets  are  retransmitted  more 
often.  Due  to  condition  (a)  alone  a  packet  of  length  x  will  be  suc¬ 
cessfully  transmitted  with  probability  e”^^  and  requires  an  average  of 
e^  transmissions  to  be  successful.  Thus 

fyCy)  =  fx(y)/Mj^(g). 
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Also 


dM^(g) 


and  =  1/Mj^(g)  . 


Thus 


s  = 


q 

m;^ 


-gM^,(g)/M^(g) 


This  also  reduces  to  s  =  ge”^^^  when  M^(g)  =  e^^.  This  result  has 
been  already  established  [5]  but  is  derived  here  in  a  different  manner. 


B.3  CSMA 

Refer  to  Figure  10  for  CSMA.  After  an  idle  period  a  packet 
scheduled  with  rate  g  is  transmitted.  The  packet  lasts  for  X  seconds. 
In  the  propagation  time  a  after  transmission  any  other  scheduled  packet 
can  also  be  transmitted  thus  causing  a  collision.  We  assume  X  >  a. 
Again  we  assume  zero  capture.  If  no  such  packet  is  transmitted,  then 
the  original  transmission  is  successful,  lasts  for  X  seconds,  and  is 
followed  by  an  a  second  period  to  clear  the  channel  and  the  idle  state 
resumes.  Thus  the  successful  rate  is 


s  =  ge'^^  P(channel  is  idle). 


But 


PCchannel  is  idle)  =  i/g  E(Z) 


where  1/g  is  the  average  idle  time  and  Z  is  the  busy  period  exclusive 
of  the  last  a  seconds. 

To  evaluate  E(Z)  we  denote  the  times  of  the  transmissions  of 
interfering  packets  as  Tj  and  their  lengths  as  Xj.  The  number  of  such 
packets  is  exponential.  Collisions  depend  only  upon  the  propagation 
time  a  and  not  on  the  length  of  the  transmitted  packet  as  in  ALOHA. 
Thus 
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Z  =  max  {X.  T.  +  Xj}. 

and  F^Cz)  =  F^(2)  Z  [F^(2)1^^^  e‘9^ 

Ic  0 

where  W  =  T.  +  X.,  Tj  is  uniform  in  (0,a)  and  is  independent  of  Xj, 
which  is  distributed  as  X.  Thus 


and 


F2(z)  =  ® 


-ga(l  -  F^.(2)] 


E(Z)  =  /  11  -  F2(2)]  d2. 
0  ^ 


The  last  two  equations  can  be  used  to  find  E(Z),  although  not  easily. 
Finally 

c  -  qe  ^ 

®  ■  1  r  ga  +  gE(Z)  * 


B.4  CSMA/CD 

Refer  to  Figure  11  for  CSMA/CD  (unslotted).  Here  collisions  are 
detected  and  transmissions  aborted.  Again  packets  are  scheduled  with 
a  rate  g.  After  an  idle  period  a  packet  is  transmitted.  If  no  packets 
are  transmitted  in  the  next  a  seconds  that  packet  is  successful.  After 
a  seconds  to  clear,  the  channel  returns  to  idle.  A  scheduled  packet 
can  be  transmitted  in  the  first  a  seconds  and  will  cause  a  collision. 
But  it  will  be  aborted  a  seconds  after  the  original  transmission.  The 
original  transmission  will  be  aborted  a  seconds  after  the  start  of  the 
colliding  packet.  Thus,  as  before. 


28 


s  =  ge'^^  P(channel  idle). 

PCchannel  idle)  =  i^g  v  e^T) 

and  2  =  X,  with  prob.  e"^^ 

min{T.}  +  a,  with  prob.  1  -  e‘®^. 

Here  we  assume,  for  convenience,  that  X  >  2a.  Solving  we  get 

E(Z)  =  E(X)e’3^  +  I  +  a) 

k“l 

=  E(X)e"9^  I  ■  (l-^ga)e‘9^]  +  a(l  -  e"^^) 


1  +  gE(X)e'9^  +  (l+2ga)(l-e’9^) 


Note  than  only  E(X)  appears  in  the  above  equation. 
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FIGURE  9 
PURE  ALOHA 
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ABSTRACT 

We  consider  the  problem  of  obtaining  exact  expressions  for  throughput  and  blocking 
probabilities  in  multihop  packet  ratio  networks  operating  under  CSMA.  We  obtain  exact  results 
for  a  general  class  of  message  lengths,  for  general  topologiet,  and  for  perfect  capture.  These 
results  are  obtained  by  assuming  perfect  acknowledgments. 


I.  INTRODUCTION 

We  consider  the  problem  of  obtaining  exact  expressions  for  throughput  and  blocking 
probabilities  in  multihop  packet  radio  networks  operating  under  earner  sense  multiple  access 
(CSMA).  Procedures  are  developed  which  can  be  used  to  analyze  general  topoiogies  for  a 
general  class  of  packet  length  listribuuon.  Hxampies  of  chains,  rings,  and  stars  are  presented. 

II.  THE  NETWORK  MODEL 

We  consider  the  problem  of  analyzing  the  throughput  capability  of  a  multihop  packet 
radio  network  operating  under  carrier  sense  multiple  access  (CSMA).  Thus,  we  assume  that 
the  network  is  comprised  of  terminais  equipped  with  radio  transponders  suiuble  .''or 
broadcasting  data  over  a  limited  distance.  In  general,  the  source  and  destination  termi.nais 
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caanot  bear  each  other  directJy,  and  the  dau  has  to  be  relayed  by  one  or  more  intermediate 
devices.  A  separate  set  at  devices,  called  repeaters,  may  exist  for  this  purpose,  or  the  terminais 
themselves  may  relay  messages  for  one  another. 

Control  of  the  network  is  completely  distributed,  i.e.,  no  sution  or  central  control 
mechanism  is  assumed  to  exisL  Rather,  we  assume  that  each  source  terminal  has  prestored 
one  or  more  routes  to  ail  desdnadons  and  includes  all  necessary  roudng  informadon  in  the 
packets  if  transmits.  These  assumpdons  are  made  merely  to  simplify  the  presectadon.  In  fact, 
the  results  presented  are  valid  for  networks  using  alternate  roudng  as  long  as  roudng  changes 
are  not  made  over  short  dme  intervals.  One  of  the  modvadons  for  this  study  came  from  a 
consideradon  of  the  design  of  roudng  procedures  for  such  networks.  It  was  necessary, 
however,  to  first  develop  an  understanding  for  the  throughput  of  various  topologies. 

Exogenous  traffic  is  modeled  as  independent  Poisson  processes  arriving  at  each 
source  node,  with  appropriate  rates  and  packet  lengths.  The  topology  is  specified  by  a  listing  of 
which  terminals  (or  repeaters)  can  hear  each  other.  In  the  remainder  we  will  .not  distinguish 
between  terminals  and  repeaters  and  will  refer  to  them  collectively  as  either  terminais  or  nodes. 
In  general  the  transmissions  of  one  terminal  can  be  heard  by  many  other  terminals.  The 
roudng  will  specify  which  terminai  is  to  repeat  the  packet,  if  necessary. 

If  two  or  more  transmissions  are  simultaneously  heard  by  a  terminal  (called  a 
’collision*)  at  least  one,  and  possibly  both,  is  ’lost*  and  must  be  retransmitted.  We  assume 
retransmissions  are  scheduled  at  a  random  insunt  in  dme  suSciendy  far  in  the  future  so  as  to 
preserve  the  Poisson  nature  of  the  combined  traffic  stream,  which  now  consists  of  exogenous 
traffic  and  rescheduled  traffic.  For  this  study,  we  assume  that  a  packet  can  be  retransmitted  as 
many  times  as  is  necessary,  i.e.,  that  there  is  no  maximum  allowable  number  of 
retransmissions. 

At  any  time,  terminals  may  either  transmit  or  receive,  they  cannot  do  both 
simuluneousiy.  Before  transmitting,  a  terminal  senses  the  channel.  If  it  detKts  that  any  of  its 


r 
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neighbon  (i.e.,  teminals  that  it  can  hear)  are  tnnsmittiag  (by.  e.g.,  sensing  a  earner)  it 
rescheduies  the  tranamisaion  aa  for  collided  pickets  above.  If  at  the  scheduled  time  for  a 
tnnamiaaion.  the  tenninai  is  already  engaged  in  transmitting  a  packet,  the  new  packet  is  also 
rescheduled  aa  above.  Thus  packets  are  continually  rescheduled  until  they  are  successfully 
delivered  to  the  next  terminal  on  their  route.  We  assume  that  the  total  stream  of  traffic 
scheduled  by  any  tenninai  is  a  Poisson  process.  This  includes  originating  traffic  and  packets 
rescheduled  either  due  to  collisions  or  due  to  the  channel  having  been  sensed  busy.  This 
scheme  is  called  carrier  sense  multiple  access  (CSMA).  The  Poisson  assumpdon  is  valid  for  the 
assumptions  made  above  and  will  yield  accurate  results  for  throughput  Compromises  will  have 
to  be  made,  however,  if  an  aocurate  picture  of  time  delays  is  to  be  considered. 

It  is  possible,  due  to  non-zero  propagation  delay,  that  collisions  of  transmissions 
from  neighboring  terminals  may  still  take  place  despite  the  CSMA  strategy.  This  will  occur  if  a 
terminal  senses  the  channel  before  another  terminal’s  transmission  is  received.  This  effect  is 
small  if  terminals  are  reasonably  close  or  are  not  transmitting  at  high  speed.  We  wiU  ignore  this 
phenomena  here,  and  assume  that  ail  transmissions  are  instantly  heard  by  their  neighbors. 

A  passive  acknowledgment  is  used  for  transmission  to  neighbors.  The  transmitting 
terminal  listens  to  the  channel  to  hear  if  a  packet  is  being  rebroadcast  by  a  neighbor.  If  after  a 
prespecified  time  interval,  the  transmitting  node  does  not  hear  the  packet  rebroadcast,  it 
retransmits  the  packet.  But  the  packet  may  have  been  successfully  received  by  the  neighbor 
even  though  the  originator  does  not  hear  the  rebroadcast.  Duplicate  packets  may  be 
transmitted  and  deleted  only  at  the  final  destination  or  they  may  be  detected  and  deleted  earlier. 
An  end-to-end  acknowledgment  is  returned  to  the  originator  from  the  final  destination.  In  this 
paper  we  assume  that  passive  acknowledgments  are  always  heard  ->*  ignore  the  effect  of  end- 
to-end  acknowledgments.  Alternately,  these  acknowledgements  could  .have  been  added  to  the 
required  traffic.  (In  a  sequel  paper,  the  effect  of  passive  acknowledgements  will  be  studied). 


We  depict  the  topology  of  the  network  by  a  graph  where  terminals  are  represented 
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by  nodes.  Tbe  nodes  are  connected  by  a  link  if  they  can  hear  each  other’s  tnnsmissions.  i.e..  if 
they  are  neighbors.  As  an  example  see  Figure  1.  Node  .A  can  bear  node  B.  but  not  node  C. 
Node  3  can  hear  both  nodes  A  and  C  Node  C  can  hear  node  3.  but  not  node  A.  If  node  .A  is 
transmitting  to  node  B  and  node  C  begins  transmitting,  then  the  transmission  from  A  to  3  may 
be  lost  depending  upon  the  'capture'  assumptions  we  make.  A  conservative  assumption  is  that 
the  A  to  B  transmission  is  lost  •  this  is  known  as  zero  capture,  .alternatively,  perfect  capture 
assumes  that  this  transmission  is  successfully  received.  Half*ampiitude  capture  assumes  that 
the  transmission  is  lost  if  C  dominates  A  at  3.  This  can  happen  if  C  is  closer  to  3  than  A  is  to 
3,  or  has  a  greater  signal  strength  perceived  at  3  than  A  has.  If  A  dominates  C.  then  the 
transmission  is  successfuL  However,  in  all  cases  of  a  collision  we  assume  the  later  transmission 
is  lost.  Thus  if  C  is  transmitting  to  3,  this  packet  is  lost  in  all  cases.  We  will  consider  only 
perfect  capture  sittudons  below.  Note  that  under  CSMA  if  node  3  is  transmitting,  neither  A  or 
C  is  allowed  to  transmit. 

We  assume  that  a  routing  has  been  speched.  This  takes  the  form  of  deciding  which 
of  the  neighbors  are  to  rebroadcast  a  packet  from  a  particular  source  to  a  particular  destination. 
Thus  the  amount  of  traffic  that  a  terminal  '«ishes  to  send  to  its  neighbor  can  be  computed.  If 
these  rebroadcast  packets  are  scheduled  at  a  random  time  far  in  the  future  the  Poisson 
assumption  for  traffic  streams  is  preserved.  We  assume  that  the  traffic  between  neighbors  is 
specified  and  form  independent  Poisson  processes.  We  assume  that  the  packet  length  is 
reassigned  independently  at  each  hop.  This  is  analogous  to  the  'independence  assumption’  in 
queuing  networks. 

The  details  of  CSMA  for  a  single  hop  network  can  be  found  in  the  papers  by  Tobagi 
and  Kleinrock^^V  Tobagi  has  also  developed  some  simple  models  for  two-hop  networks^"^ 
Details  of  a  packet  radio  network  can  be  found  in  a  paper  by  Kahn^^\  A  discussion  of  routing 
in  multihop  packet  radio  can  'oe  found  in  the  paper  by  Oilman,  Van  Slyke,  and  Frank^‘*\  .\n 
earlier  version  of  this  paper  was  presented  at  ICC’80(5).  More  details  can  be  found  in  the 
thesis  of  Sahin(6). 


in.  GE.NERAL  RESULTS 


la  this  SMtion  we  develop  some  expressions  that  are  valid  for  the  packet  radio 
network  we  have  modeled  above  using  CSMA  and  with  aa  arbitrary  packet  length  distribution. 
Let  i  be  a  node,  it,  oae  of  its  neighbors,  y'  the  set  of  ail  the  neighbors  of  i,  aad  Nt  the  set  of 
all  i's  aesghbon,  iadudiag  L  Let  be  the  total  rate  (in  packets/sec)  of  all  scheduled  traffic  at 
node  i.  This  includes  originating  traffic  and  all  rescheduled  traffic  and  is  assumed  to  be  Poisson. 
Let  l/ni  be  the  average  length  of  packets  transmitted  by  node  L  Let  Ci  be  a 

nonnaiised  rata. 

Node  i  is  either  busy  (transmittiag}  or  idle.  It  wiS  transmit  a  scheduled  packet  if  at 
the  instant  it  is  scheduled  ail  nodes  in  y,  are  idle.  Let  A  be  a  set  of  nodes.  Let  P(A)  be  the 
probability  that  at  a  random  instant  all  nodes  in  A  are  idle.  The  nodes  not  in  A  nuy  or  ouy 
not  be  idle.  Similarly  Pfil,  P<i,A),  P(A.B)  are  the  probabilities  that  i  is  idle,  node  i  and  nodes 
in  A  are  idle,  and  all  nodes  in  A  aad  B  are  idle. 

Since  traffic  is  scheduled  at  node  i  with  a  Poisson  rate  £ ,  will  be  transmitted  only  if 
iV(  is  idle,  and  transmissions  have  average  length  1/mi,  the  probability  that  i  is  busy  is  given  by 

(1) 

If  i  is  busy  then  under  CSMA,  R|  must  be  idle.  Then  since 

P(n,)  +?(«,!/  htuy)(l-P(i)] 

aadP(/i(|/  iusy)  —  1,  we  have 


P(/i,./)-P(/i,)*P(0-l 


Similarly,  if  C  .V/*, 


C) 
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P{A.i)~P(A)^P{i)  -  I 
letting  A  ■■  /V*  in  equation  (3),  and  using 

P{N:)~P{N^/P{i\N:) 


(3) 


we  get 


Equation  (4)  is  often  found  in  CSMA  literature. 

\  packet  from  i  to  /if  will  be  nnsmitted  when  it  is  scheduled  if  Si  is  idle.  During 
the  transmission  all  nodes  in  S‘  will  be  idle.  It  will  be  successfully  received  at  /if  if  ail 
neighbors  of  /if  not  in  St  are  also  idle  at  the  beginning  of  transmission.  Otherwise  a  collision 
will  occur.  Let  Stj^  be  the  rate  in  pachets/sec,  determined  by  the  routing  and  assumed  Poisson, 
of  the  traffic  that  i  wishes  to  send  to  /if.  This  is  the  required  throughput  or  offered  traffic.  Let 
be  the  rate  of  ail  scheduled  traffic  from  i  to  /if.  We  have  also  assumed  that  all  these 
streams  are  Poisson  and  independent.  Of  these  gtj^  packets  per  second,  if  ^  must  be 
successful.  Thus 


fOVf^V,,) 


where  5,.;^  “•f/V'*'  ^'.-t  • 

The  total  scheduled  traffic  (normalized)  at  a  .node  is  given  by 


Ot 


2 

»,./v,* 


(5) 


(6) 


From  equations  (1)  through  (6)  we  wish  to  derive  a  relation  between  the  Stj,^  and  Gt,  and 
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detennine  the  maximum  Si^,^  the  network  can  support.  This  we  call  the  (maximum) 
throughput  or  capacity.  Xu  the  next  section  we  develop  this  relationship  for  exponential  packet 
lengths  and  arbitrary  topologies.  Later  we  extend  the  analysis  to  a  general  class  of  packet 
length  distributions. 

IV.  EXPONENTIALLY  DISTXIBirrcD  PAOLET  LENGTHS 

If  the  packet  lengths  are  exponentially  distributed,  then  the  system  can  be  viewed  as 
a  Markov  process  where  the  states  are  identified  by  which  nodes  are  idle  and  which  are  busy. 
Let  D  be  a  set  of  busy  nodes.  Because  of  CSMA,  no  nodes  in  D  may  be  neighbors  of  each 
other.  Let  Q(D)  be  the  probability  that  at  an  insunt  of  doe,  all  nodes  in  D  are  busy,  and  ail 
nodes  not  in  0,  are  idle.  Then  each  set,  0,  represents  a  smte  in  a  Markov  system,  and  Q(D)  is 
the  state  probability.  In  perticular.  the  null  set  D  •*  p,  represents  the  state  that  all  nodes  are 
idle. 

Assume  the  system  is  in  state  O.  It  will  leave  the  sute  if  any  itD  stops 
tnuismitdng.  This  happens  with  rate  Thus  the  transidon  to  state  {D>i}  occurs  with  rate  n,. 
The  only  other  way  to  leave  state  D  is  for  one  of  the  idle  nodes  that  is  not  a  neighbor  of  any 
i*D  to  begin  to  transmic  This  occun  with  rate  ;y.  Let  be  the  set  of  ail  neighbors  of  ail 
nodes  in  D.  Then  the  transidon  from  D  to  {D+jK  yciV^,  occurs  with  rate  gj.  The  global 
balance  equadons  for  this  system  are 

(2  ^  +  S  -  2  *  +  2  >*;  P) 

j  ik  >*0  t*o  f  ^  ''•o 

where  0  is  one  of  the  special  sets  defined  above. 

It  is  easy  to  see  that  these  equadons  are  satisfied  by 

C(D) -  — fl(O-i) -(JiCvfJ-i), 


(8) 
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C(Z))-(n  G,)  Q(4) 

iU> 


where  we  adopt  the  convendon  that  n  (7(  ^  1.  Summing  over  all  0,  we  get 

t** 


2  Q(D)  -  2  1 


la  the  previous  seedon  we  found  we  were  interested  in  quanddes  like  P(A),  where 
A  is  any  set  of  nodes,  and  P(A)  is  the  probability  that  all  nodes  in  A  are  idle,  and  all  nodes  not 
in  A  may  or  may  not  be  idle.  This  can  be  found  by  saraating  Q<D)  over  all  sets  D  that  do  not 
contain  nodes  in  A.  Thus 


2 

OCA* 

2  (n 


where  D  C.A*  refers  to  all  such  sets  contained  in  the  complement  of  A  and  N  is  the  set  of  ail 
nodes.  We  adopt  the  shorthand  noudon. 


where  SP  refers  to  sum  of  products.  Thus 


P{A)  -5f»(^‘^)/5P(iV) 


Equations  (2),  (6),  and  (11)  can  be  used  for  any  topology  to  generate  the  solution 
to  our  problem.  The  equations  relating  the  Jr.-,.  (7,,^.  and  Gj  can  be  solved  iteratively.  For 
example,  equation  (5)  now  becomes 
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S,j,, 

"  sP{y) 


(U) 


where  by  A+B  we  meen  the  union  of  A  and  B. 

Eveluation  of  sums  of  products  in  equation  (12)  are  nude  easier  by  the  following 
two  roles.  Consider  two  sets  of  nodes  A  and  B  such  that  no  node  in  A  can  hear  any  node  of  B. 
Then 


5P(.4+B) -5f*(/l)SP(B). /I  (15) 


Also, 


SP(A)  -57(v4-i)  -r  G,SP(A-y,),  itA  (16) 

To  prove  these  rules  just  consider  all  producu.  We  have  successfully  evaluated  many  complex 
topologies  with  these  procedures. 

There  are  other  relations  which  will  be  found  useful  in  extending  cur  model  to  more 
complex  situations.  We  prove  some  of  these  below.  Let  C  be  a  cut,  i.e.,  a  set  of  nodes  that 
divides  the  network  into  three  pans  B,  and  C,  where  A  and  B  have  no  neighbors  in  common 
as  in  equation  (13).  Let .4j,  B  ■■  Si -I- Bj  where “  iJ*-  Then 

I/-  ,  ,  f»(^i.C.J,)  _  SP(At-h3:)  _  SP^AQ 

(  ,1C.5,)  p^cj,)  SPiA-rBi)  SP[A) 


But 


^  P{C)  ""  SP{A^3)  "  SP(A) 


Thus 
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?(.4,lC.Ji)  C  a  aa 


(17) 


we  also  have 


P{AxBx\C)  •  P{Ax\C)F{ax\C),  C  a  aa  (18) 

In  pardcniar  if  C  ™  .Vj",  then 

PUW^)^BU\y!),BCyr  (19) 

V.  A  GENUAL  CLASS  OF  PACEET  LENGTH  OISTEIBITTIONS 

In  this  section  we  extend  the  resaits  just  proven  to  include  a  general  class  of 
distribudons  for  the  packet  length.  We  will  show  that  the  pnxedures  developed  for  perfect 
capture  are  independent  of  packet  length  distzibuuon.  To  prove  this  we  start  with  a  simpler 
extension.  In  the  above  we  assumed  that  all  packet  lengths  are  exponentially  distributed  and 
have  the  same  mean  when  transmitted  by  a  node  to  any  of  its  neighbors.  Did'erent  nodes  may 
transmit  different  average  length  packets,  however.  Now  assume  that  while  ail  packet  lengths 
are  still  exponentially  distributed,  the  average  length  packet  transmitted  from  a  node  may  be 
different  to  each  of  iu  neighbors.  This  will  be  useful  in  anaiyang  different  protocols  (to  be 
presented  in  a  ;*euel  paper)  but  is  presented  here  as  the  first  step  in  the  desired  extension. 

Now  the  state  of  the  network  depends  upon  who  is  transmitting  and  to  whom.  We 
can  keep  the  same  structure  by  breaking  every  node  into  a  set  of  ’micronodes',  one  for  each 
neighbor.  These  nodes  may  be  indexed  by  (/.nt).  If  i  is  transmitting  to  n,  then  this  node  is 
active,  otherwise  it  is  idle.  Micronodes  are  connected  in  our  topology  if  they  can  hear  each 
other.  Since  CSMA  still  prevails,  all  micronodes  for  a  given  node  are  fully  connected. 
Furthermore  all  micronodes  for  nodes  that  are  connected  in  the  original  topology  are  also  fully 
connected.  The  analysis  now  proceeds  as  above  since  the  Markovian  property  has  been 
maintained.  For  example,  equation  (14)  still  bolds,  but  now  .V,  and  are  collections  of 
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tnicronodes.  Note  that  S,  contains  the  fuU  set  of  micronodes  for  i  and  ail  neighbors  of  i. 
and  Gi^  have  the  same  meaning  as  before.  However  they  are  normalized  by  1/M(^,  the 
average  packet  length  for  packets  going  from  i  to  /((.  The  terms  in  the  sums  of  products  are 
for  the  micronodes. 

Let  A  contain  full  sets  of  micronodes  and  include  the  node  i.  Then  from  equation 

(16) 


SP(A)-SPlA-(i,ni)]^Gt^^SJ>iA-N,)  (20) 

Here  we  have  used  the  fact  that  and  the  notadon  that  y,  is  the  set  of  all 

micronodes  of  i  and  neighbors  of  i.  Since  this  is  equivalent  to  the  original  set  of  node 
neighbors  of  i  we  keep  the  same  noudon.  Repeadng  for  all  neighbors  of  i  we  get, 

5?(^)  •SPiA-i)  +  (2  Gi^)SP{A~y,)  (21) 


If  we  let 


-  —  -  2  -  2  (22) 

then  equadon  (16)  is  preserved.  In  a  similar  manner  all  previously  derived  equations  can  be 
maintained  where  Gi  takes  on  the  definidon  in  equadon  (22).  Here  l/u^  is  an  average  packet 
length,  averaged  over  the  different  average  packet  lengths  to  different  neighbors  in  propordon 
to  their  scheduled  rate. 

We  now  prove  our  main  theorem  for  this  section.  Assume  that  the  length  of 


packets  transmitted  from  i  to  n,  has  the  density 
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2 

j 


i 


(23) 


where 


a,^j  2  0  aad  2  “  1 

j 

Thus  the  length  is  distributed  as  a  posidve  sum  of  exponentials,  .\nother  way  of  looking  at  this 
is  that  the  a’s  are  the  probabilides  of  choosing  the  associated  exponential  density.  Now  create 
oicronodes  for  each  triple  Here  we  use 

The  micronodes  for  some  i  and  any  are 

fully  connected  as  are  the  micronodes  for  neighboring  nodes.  Equadons  (20)  and  (21)  now 
became 


and 


S?(.4)  -  (z./i,  J)1  +  G,^jSP{A-S,) 

SP{A)  ^SP{A-i)  +  (2  (J,^j}s?(.^-iVi) 


(24) 

(25) 


In  the  same  manner  as  above,  we  let 


2 


Z 


T 

J 
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These  eqiudons  are  used  to  Snd  P(A)  in  terms  of  Gi  and  are  idendcal  in  form  to  those  derived 
for  exponentially  distributed  packet  lengths. 


Equadon  (5)  in  turn  comes  from 
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Summing  over  j,  we  get  equation  (2) 


Thu*  all  relations  between  the  G's  and  the  5*$  are  preserved.  The  aaual  nature  of  the  problem 
is  taken  into  account  by  the  relationship  between  the  S*s  and  the  normaliation  by  the  m’s- 

We  can  now  restate  the  above  theorem.  Let  i  be  any  node  and  J*N‘,  the 
neighborhood  of  L  Let  r^j  be  the  successful  (desired)  nte  from  i  to  j,  in  packets/sec.  Let  the 
density  of  the  packet  lengths  be 


-  S  **’■'■*  nijj,  >  0.  0. 

k 

2  •  1- 


Let  1/m/j  “  2  average  packet  length,  in  seconds.  Let  5,^  -■  Sf^/n.j.  Then 

k 


-  ?(.V,UiV,)  -  a  funcion  of  (G,.(7:,  •  •  •  ) 


where 


"  2 


I**  ^IJJL  * 


Then  Sijj,  -  a,jj,s,j  -  f»(.V,  UiV^)g,^jk 
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when  Gi  -  2  “  2  2 

y«V,*  * 

Dividing  Sfjj^  by  and  summing  over  k,  we  get 

VL  EXAMPLES  OF  THE  PROCEDUaE 

As  an  example  consider  tbe  chain  of  four  nodes  shown  in  Figure  2.  We  assume 
•^11  “ -^u  “•S’a  -Jn  ■■  ■S’34  ™ -Jo  ™ for  simplicity,  and  perfea  capture.  Also  note  that 
^la  ■■  Cl,  <743  (74  and  by  symmetry  (7i  ”  <74.  <72j  <734.  and  Gn^Gyz-  Also  from 

equation  (6),  (7j  ™  (7jj  +  Gi-  From  equation  (5),  we  have 

-  F (1.2,3)  -  -  ?( 1.2.3. 4) 

Oi  (/jj  C/23 

But 

-  2  (n^()  ’-\^Gx-Gz^G^^G^^G^G-iT  GzG^^G^G^ 

itO 

-1  +  :(7,  +  2(7i>2(7,<7:^<7,2  -  A 


and 


F(I.2,3.4)  -  1/A,  F(1.2,3)  -  -  (1^(7, )/A 


Solving,  we  get 


Gj  -  Gi(2-K7,)  and  S  -  CiCl-i-GO/A  . 


or 


S  -  (7,(l-K7,)/(l-H5(7,+7G,-+2(7,’). 
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We  can  sow  and  the  maximum  value  of  S  possible,  the  throughput  of  the  chain,  which  is  .123. 
obtained  when  G  i  “  0.71. 

In  general  the  equadons  cannot  be  solved  as  simply  as  for  this  four  node  example. 
Equadon  (11)  is  used  to  get  expressions  for  P(.A)  in  terms  of  <7f.  Then  equadon  (14)  is  used 
to  get  expressions  for  Stj^  in  terms  of  Gtj^  and  P(A).  Equadon  (6)  provides  the  reladon 
between  G,  and  The  Si^  are  found  from  the  offered  uaffic,  the  roudng,  and  other 

assompdons  and  are  considered  as  inpuu.  The  equadons  are  iterated  until  a  soludon  of  G's  for 
a  set  of  S^s  is  found.  The  maximum  set  of  S's  possible  is  considered  the  throughput,  or 
capacity,  of  the  network.  For  some  modest  size  problems,  as  above,  the  equadons  can  be 
solved  directly. 

.As  a  second  example  consider  the  star  topology  shown  in  Figure  S.  Here  assume 
there  are  L  tegs  of  N  *  2  nodes  each.  Denote  the  center  node  by  0,  the  nodes  one  hop  out  by 
1,  and  the  other  nodes  by  2.  Further  assume  symmetrical  trafSc  in  the  nodes  and 
•^01  “  -^lo  “"Ju  S.  Then  LS  is  the  total  traffic  successfully  transmitted  by  node  0.  The 

equadons  are 


LS 

Go 


■12,  —  P[aJl  bui  L—\  ilnodts) 

</io  G\o 


SP{L~\  i  2  nodes) 
SPUV) 


-(!+<;  i)^"‘/.i 


P(0,1.2) 


5PfL-l  /err) 
SPiN) 


(1^(7, 


where 


h-5P(.V)  ^Cod^C?:)^ 

<7i  -  (7 10  <7,2 


But  <7,0  "  G^L  and  5,2  “  G2,  so  C,  *  <72  +  G(JL.  Thus  we  have  two  equations: 
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LS/Co  -  (1-K?2)^"'/A  and  5/Gi  -  ( l-2C:-ryo/^  )^*‘A 

For  any  5  <  S^,  they  can  be  solved  for  Go  and  G:-  Alternatively  for  any  Gq  we  can  Snd  the 
corresponding  G;  by  solving 


Cad+G:)^**  -G^l  '-2G:+G»^t)‘-'  . 

Then  the  relatiaa  between  S  and  Gj  can  be  studied.  We  then  find  the  maximum  S,  S _ 

possible.  LSau.  >>  the  maximum  throughput  of  the  star. 

For  larger  problems  we  will  get  several  eqtudons  of  the  form 

Go-i:^Ml-K?2)^"‘ 

Gj  -  5Ml-!-2G2+G(/i)^*' 

For  any  S  we  solve  these  iteradveiy.  Since  Go  2:  LS,  G|  ^  25,  and  G:  ^  5,  the  lower 
bounds  are  good  starting  points  for  G(.  For  S  sufficiently  less  than  5au  have  found  the 
iteration  converges  monotonicaily  and  rapidly.  As  S  approaches  from  below  the 

convergence  is  soil  monotonic  but  slows  appreciably.  For  5  >  the  iteration  does  not 
converge  and  often  diverges  dramatically.  We  have  uncovered  no  serious  numerical  problems 
with  this  procedure  in  the  many  examples  we  have  evaluated. 

VTI.  NUMEaiCAL  EXAMPLES 

We  consider  here  three  different  topological  structures  with  exponentially  distributed 
packet  lengths  and  penect  capture.  We  assume  all  S,  for  all  i,  and  uke  full  advanuge  of 
symmetry.  The  three  topologies,  shown  in  Figures  3,  4,  and  5,  are  a  chain,  a  ring,  and  a  star, 
ail  with  various  lengths.  In  each  case  we  find  the  maximum  throughput,  S.  These  are  given  in 
Table  I. 

The  maximum  one  way  throughput  for  a  long  chain  is  S  *■  .086.  This  throughput  is 
approached  when  the  lengths  of  chains  exceed  10.  For  smaller  length  chains  the  throughput  is 
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higher.  In  CSMA  transmisiions  of  neighbors  may  .not  overlap  in  time.  Since  each  nodes 
cransmiu  successfully  2S  packets  per  average  packet  transmission  time,  then  we  must  have  S  2 
1/5.  The  throughput  is  slightly  than  half  of  this  limiL  The  cause  of  the  reduction  is 

collisions  from  transmissions  two  hops  away,  the  sonalled  ’hidden  terminals*.  The  throughput 
of  S  «  .086  for  a  chain,  although  the  maximum  possible,  is  not  a  useful  operating  point.  As  in 
ALOHA,  this  is  the  point  at  which  delays  become  infinite  and  the  system  is  unsuble.  The 
network  would  have  to  be  operated  at  some  lower  level 

It  is  instructive  to  compare  the  performance  of  multihop  CSMA  with  that  of  slotted 
ALOHA.  Let  p  be  the  probability  of  transmission  in  one  direction  at  a  node.  Then  S  *  p(l- 
2p)^  for  a  long  chain.  5^  here  is  .074  which  is  approximately  14%  less  than  that  for  CSMA. 
There  are  two  factors  working  here.  CSMA  will  produce  less  collisions  since  neighbors  will  not 
interfere  with  each  other.  Hidden  terminals  will  still  produce  collisions.  (All  terminals  are 
hidden  in  ALOHA).  But  CSMA  prohibits  possible  successful  tmnsmissions.  For  instance, 
node  3  can  transmit  successfully  to  node  2  while  node  4  is  transmitting  successfully  to  node  5. 
This  is  possible  in  ALOHA  but  prohibited  in  CSMA.  This  is  one  of  the  prices  paid  to  control 
collisions. 

We  note  that  for  a  ring  greater  than  7  nodes  the  maximum  throughput  is  the  same 
as  that  for  a  long  chain.  This  is  expected  since  the  congestion  is  now  in  the  middle  and  it  is 
unimportant  whether  or  not  the  chain  is  closed.  A  star  with  two  legs  is  just  a  chain  with  N  for 
the  star  replaced  by  2N+1  for  the  chain. 

Consider  the  star  configuration  as  representing  the  center  node  (0)  trying  to  transmit 
to  some  node  or  nodes  far  away  via  many  repeaters.  For  one  leg,  the  maximum  rate  is  .086. 
For  two  legs,  the  maximum  rate  is  2S  or  .172,  exactly  twice.  The  results  are  shown  in  Table  11 
where  the  throughput  of  the  center  node  is  given  by  LS.  We  see  from  Table  II  that  whereas 
the  throughput  doubles  for  L  2.  it  increases  only  by  20%  when  L*3.  by  4-5%  further  when 
L*4,  and  by  2%  when  L*5.  Congestion  at  the  central  node  is  limiting  its  ability  to  increase  its 
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throughput.  Additional  legs,  beyond  three,  are  not  really  helpful. 

We  have  investigated  ways  of  reducing  the  congestion  at  the  center  node.  For  larger 
L.  the  traffic  in  each  leg  is  limited  by  congestion  at  the  center.  The  collisions  that  cause  most 
problems  are  for  transmissions  from  the  first  level  of  nodes  (1,  N+1,  2N+1,...,  (L-l)N'*'l)  to 
the  center.  These  are  collided  with  by  other  first  level  nodes.  To  reduce  these  collisions  we 
considered  connecting  the  first  level  nodes  in  a  ring  and  then  fully  connecdng  them.  These 
results  are  also  summarized  in  Table  L  When  the  first  level  is  unconnected  the  throughput 
saturates  at  .229.  When  the  first  level  is  fuOy  connected  the  throughput  with  9  legs  is  .252,  a 
15%  increase.  For  four  legs,  the  ring  connected  topology  is  best,  providing  some  compromise 
between  reducing  collisions  and  allowing  simultaneous  transmissions. 

The  best  that  we  can  expect  in  the  fully  conneaed  case  is  LS  1/3.  This  is 
because  all  transmissions  from  the  center  node  and  all  first  level  nodes  cannot  overlap.  We  will 
discuss  asymptotic  results  with  even  larger  stars  and  chains  in  the  next  section. 

Vin.  ASYMPTOTIC  RESULTS 

We  are  interested  in  asymptotic  results  for  several  reasons.  They  provide  us  with 
the  limiting  behavior  of  the  finite  networks  previously  studied.  Since  the  behavior  of  these 
networks  seems  to  converge  rapidly  with  their  size,  if  asymptotic  results  are  easier  to  obtain, 
they  would  be  useful.  We  are  also  interested  in  very  large  networks.  A  final  reason  is  to  verify 
some  of  the  bounding  arguments  on  throughput  made  in  the  last  section. 

We  first  consider  an  infinitely  long  chain.  We  let  ■■  5.  Then  all  nodes 

are  identical.  Also  (/(j+i  ■■  Ou-i  *  <J(/2.  Thus  with  (7(  *  (7, 


2^ 

G, 


5F(— oo /— ao) 

SP(—so . oo) 


(:9) 


We  can  write  the  denominator  as 


1 


-  19- 


iIP(— oo,...,ao)“S?(— 30,  •  •  •  j  — l)5^({-*-I . co)-HjiSP{-r30,...J--2)SP(i-^2,. 


converges  is  independent  of  L  Then  equation  (29)  becomes 


25  ,  I 
G  QiQi^Qi 

But 


n  a  G  SP(  — co...».i-ri(;— 2) 


Qi-l 


andQo"*!.  Thus  Qj  *  fii  +  G.  Therefore 


2S/G 


1 

e,(c,  +  2(7) 


or 


5 


<7 

2Q,(Q,^2G) 


We  note  that  since  ^.i 


from  equation  (31)  we  have 
Q\ 


or 

-Ct(Cr'i) 


Finally  we  have 


5 


g.-i 

2g,(2g,-l) 


.2i  a  I 


..,00). 

for  k  ^0  and  if  it 

(30) 

(31) 


(32) 

(33) 

(34) 


The  maximum  value  of  S  is  .086,  reached  when  Qi  *  1.7  or  G  *  1.2. 
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IX.  EXTENSIONS  AND  CONCLUSIONS 

We  have  presented  a  ample  but  fairly  realistic  model  of  a  muitibop  packet  ndio 
network  and  have  obtained  maximum  throughputs  for  general  topologies  and  packet  lengths. 
We  have  assumed  perfect  reception  of  acknowledgments  and  have  not  included  additional  traffic 
due  to  end-to-end  acknowledgmeats.  Some  aspecu  of  acknowiedgmenu  can  be  included  by 
increasing  the  required  traffic.  We  are  investigating  the  effiect  of  imperfect  acknowledgments 
and  different  retransmission  strategies.  The  model  should  still  be  useful  under  these 
extensions. 


REFERENCES 


[1]  L  Kldnrock  and  F.  Tobagi,  ’Packet  switching  in  radio  channels,*  parts  I  and  H.  I£££ 
Trans.  Commun.,  vol.  COM-23,  pp.  1400*1433,  December  197S. 

[2]  F.  Tobagt,  ’Analysis  of  a  Two-Hop  Centralized  Packet  Radio  Network*,  Part  II:  CSMA 
IEEE  Trans.  Commun.,  COM-2S,  pp.  208-216,  February  1980. 

l3l  R-  E.  Kahn,  The  organization  of  computer  resources  into  a  packet  radio  network,*  IEEE 
Trans.  Commumeadons,  Vol.  COM-25,  pp.  169-178,  January  1977. 

[4]  L  Gitman,  R.  Van  Syke,  and  H.  Frank,  ’Routing  in  packet-switching  broadcast  radio 
networks,’  EEEE  Trans.  Communications,  vol.  COM-24,  pp.  926-930,  September  1976. 

[5]  R.  Boorstyn  and  A.  Kershenbaura,  *Throughput  Analysis  of  Multi-hop  Packet  Radio 
Networks,’  IEEE-ICC*80,  pp.  13.6.1-13.6.6,  Seattle,  WA.,  June  1980. 

[6]  V.  Sahin,  ’Analysis  of  Muldhop  Packet  Radio  Networks',  Ph.D  Thesis,  Polytechnic 
Institute  of  New  York,  June  1982. 


Tahiti 


Muimum  One-Way  Throughput  (S) 


Number  Star 

of  Number  of  Legs,  L 


Nodes,  N 

Chain 

Ring 

L-2 

L-3 

L-4 

L-5 

1 

1.000 

1.000 

.167 

.103 

.074 

.058 

2 

.300 

.300 

.111 

.076 

.057 

3 

.167 

.167 

.097 

.072 

.053 

4 

.128 

.073 

.092 

.070 

.054 

5 

.111 

.100 

.069 

.054 

.044 

6 

.102 

.083 

7 

.097 

.087 

8 

.094 

.083 

9 

.092 

.086 

10 

.091 

.086 

oo 

.086 

.086 

.086 

.069 

.034 

.044 

Tabtt  11 


Maximum  One* Way  Total  Throughput  of  the 
Central  Node  in  a  Large  Star  Arrangement 


Maximum  Throughput,  LS 
Center  Arrangement 

Sumbtr  of  Lift  Uttconntetd  JUnf-Conneaid  Fully  Conructed 


1 

.086 

2 

.172 

3 

.207 

.198 

.198 

4 

.216 

.228 

.216 

5 

.220 

.230 

.230 

6 

.220 

.240 

1 

.245 

8 

.248 

9 


.252 


/ 

V  01L-4)N<»2 

/ 


O  . 

^H*z  .  • 


Rq«r«  1.  A  pert  at  a  n«r»orH 


Fiflur*  i  A  3forn«t\iorl«  (L  I«qa) 


Ffqura  2.  A  four  noOo  eftoi" 


2  ^  X 

I 

Fiquro  4.  A  rfnq 


12  3 

Ftqufo  3.  A  eftoin 


APPENDIX  D 


To  appear  in  Networks  Journal. 


Generaiized  Augmenting  P-.tns  for  the  Solution  of  Combinatorial 

Gptimizaiicn  Problems 
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Abstract 

Alternating  chain  procedures  can  be  thought  of  as  generalizations  of  the  greedy 
algorithm  in  that  instead  of  accepting  the  best  remaining  element,  they  seek  to 
obtain  a  better  augmentation  by  examining  a  wider  range  of  alternatives.  It  is 
possible  to  generalize  the  notion  of  an  augmenting  sequence  to  include  augmentations 
which  are  in  effect  trees  as  opposed  to  simply  paths  such  that  these  augmentations 
are  sufficient  to  guarantee  optimality.  Unfortunately,  in  the  worst  case,  these 
trees  are  of  exponential  size.  We  examine  the  application  of  such  generalized 
augmenting  sequences  to  the  solution  of  NP-complete  problems  and  examine  their 
effectiveness  and  efficiency. 

Introduction 

The  theory  of  NP-completeness,  which  was  first  expounded  by  Cook  [1],  has  led 
to  a  search  for  a  unified  treatment  of  combinatorial  optimization  problems.  Cook, 
was  able  to  characterize  a  very  large  class  of  interesting  and  important  problems 
as  being  equivalent  in  the  sense  that  an  efficient  algorithm  capable  of  finding  an 
optimal  solution  to  any  one  of  these  problems  can  be  used  to  obtain  optimal  solu¬ 
tions  to  all  of  the  others.  .Many  papers  by  many  authors  and  an  excellent  com¬ 
pendium  [2]  of  problems  in  this  class  (as  well  as  techniques  for  proving  that  a 
problem  is  in  this  class)  have  been  published  since  Cook's  seminal  paper.  Prob¬ 
lems  in  this  class  are  called  NP-complete  problems  (or,  more  properly,  NP-hard  when 
they  are  optimization  problems  as  opposed  to  decision  problems). 

Cook's  results  can  be  interpreted  in  several  ways.  One  of  these  is  to  say  that 
many  clever  people  have  spent  many  years  trying  and  failing  to  find  efficient 
algorithms  for  individual  problems  in  this  class.  Surely  one  of  them  would  have 
succeeded  if,  in  fact,  such  algorithms  existe^.  Hence,  it  is  unlikely  that  such  an 
algorithjn  will  be  found  and  it  is  tempting  to  stop  looking  for  one.  This  leads  to 
the  development  of  heuristics  for  the  solution  of  such  problems  [3]  and  to  proba¬ 
bilistic  methods  (4). 

An  alternate  interpretation  is  that  this  pessimistic  view  is  justified  only  with  re¬ 
spect  to  algorithms  which  guarantee  optimal  solutions  and  reasonable  runtimes  for 
all  instances  (input  data  sets)  of  a  problem.  In  this  paper  we  speak  of  an  algo¬ 
rithm's  ru.'.time  teing  reasonable  if  it  grows  polyncmially  rather  than  exponentially 
with  the  size  of  ine  problem.  This  does  not  preclude  the  existence  of  algorithms 
wiLh  guaranteed  reasonable  runtimes  and  which  yield  optimal  (or  near-optimal) 
solutions  with  high  probability.  Nor  does  it  preclude  the  existence  of  algorithms 


which  guarantee  optimal  solutions  and  which  have  reasonable  runtimes  with  high 
probability.  There  are  many  examples  of  both  types  of  algorithms  which  are  used 
in  practice  to  solve  specific  NP-complete  problems.  Most  important,  the  theory  of 
NP-completeness  does  not  preclude  or  even  lessen  the  likelihood  of  the  existence  of 
algorithms  which  solve  specific  (nontrivial)  instances  of  a  problem  and  guarantee 
both  an  optimal  solution  and  reasonable  runtime. 

9 

In  this  paper,  we  explore  this  second,  more  optimistic,  point  of  view  and  present 
a  family  of  algorithms  for  the  solution  of  an  NP-complete  problem.  Some  algorithms 
in  this  family  have  guaranteed  reasonable  runtimes.  Others  guarantee  optimal 
solutions.  While  the  algorithms  are  presented  for  the  solution  of  a  specific  prob¬ 
lem.  the  technique  can  be  extended  to  the  solution  of  other  problems  as  well. 


Matroid  Theory 

A  specific  way  of  approaching  the  solution  of  many  combinatorial  optimization 
problems  is  via  matroid  theory.  The  excellent  book  by  Lawler  [5]  gives  a  complete 
treatment  of  this.  Here  we  outline  the  fundamentals  of  this  theory  which  are 
necessary  for  the  presentation  which  follows. 

A  matroid  is  a  couple  (E,F)  where  E  is  a  finite  set  of  m  ele.ments: 

E  =  {Cj  1  i  =  X.2.  ...  M} 

and  F  is  a  family  of  independent  subsets  of  E.  The  notion  of  independence  is 
quite  general.  We  require,  however,  that  it  satisfy  two  properties: 

PI:  Every  subset  of  an  independent  set  is  independent,  i.e.,  if 
I  c  F  and  J  Q_  I  then  J  e  F 

P2:  If  Ip  and  Ip^j^  are  independent  subsets  of  E  containing  P  and  P  +  1 
elements,  respectively,  then  there  exists  an  element,  e  e  Ip^j^  (e  i  Ip) 
such  that  Ip  '^{e}  is  an  independent  set  containing  P  +  1  elements. 

Given  two  matroids,  (E,Fi)  and  (E.Fj),  defined  on  the  same  set  of  elements,  but 
using  two  different  notions  of  independence,  ^ we  define  an  intersection  of  them  to 
be  any  subset,  1(2  E,  such  that  I  e  Fi,  and  I  c  Fj.  This  definition  can  be  ex¬ 
tended  to  cover  three  or  more  matroids  as  well. 

.Many  combinatorial  optimization  problems  can  be  thought  of  as  finding  the  best 
independent  set  in  a  matroid  or  the  best  intersection  of  two  or  more  matroids.  If 
weights,  w. ,  are  associated  with  the  elements,  Cj,  in  E,  then  one  can  speak  of  the 
best  set  as  being  the  one  with  largest  total  weight.  The  maximal  (or  minimal) 
spanning  tree  problem  can  be  thought  of  as  finding  the  maximum  (or  minimum) 
weight  independent  set  in  a  matroid  (E,F)  where  E  is  the  set  of  edges  in  the 
graph  and  F  is  the  family  of  forests.  A  forest  is  defined  to  be  a  set  of  0  or  more 


edges  which  do  not  contain  a  circuit.  As  another  example,  Lawler  [5,  p.  304] 
shows  that  the  Traveling  Salesman  Problem  can  be  thought  of  as  finding  the  best 
intersection  of  three  matroids.  The  problem  of  finding  the  maximum  weight  inter¬ 
section  of  three  matrcids  has  been  shown  to  be  NP-complete  [2].  Lawler  snows  [5, 
p.  364]  that  the  problem  of  finding  intersections  of  four  or  more  matroids  can  be 
reduced  to  that  of  finding  intersections  of  three.  There  are  many  other  combina¬ 
torial  optimization  problems*  which  can  be  naturally  thought  of  as  matroid  intersec¬ 
tion  problems.  The  theory  of  NP-completeness  assures  us  that  all  problems  can  be 
thought  if  in  this  way. 

We  will  consider  one  of  the  simplest  possible  3-Matroid  Intersection  Problems  in  the 
sequel  for  the  sake  of  clarity.  The  problem  considered  is  the  Three  Dimensional 
Assignment  Problem  (TDAP).  In  this  problem,  we  are  given  N  people,  N  jobs, 
and  N  days.  There  is  a  cost,  C.jj^  of  having  person  i  doing  job  j  on  day  k.  Each 
person  is  to  do  only  one  job,  each  job  is  to  be  done  only  once,  and  only  one  job 
is  to  be  done  on  a  day.  Formally  the  problem  is: 

Mlmmize  2  =  i 

1  ^ 

such  that 


•’'ijk  ^ 


Thus,  setting  to  1  corresponds  to  having  person  i  do  job  j  on  day  k.  This 
problem  can  be  viewed  as  an  intersection  of  three  partition  matroids.  Given  a  set 
of  elements,  E  (in  this  case,  the  a  partition  matroid  can  be  defined  by  a 

partition  of  E  and  a  vector.  A,  constraining  the  number  of  elements  of  E  which 
may  be  selected  from  any  part  of  the  partition.  Formally,  we  have  the  partition  of 
E  into  subsets  E^,  j  =  1,  ...  k,  where 


U  E.  =  E 
j 


E.  ^Ej  =  (^  for  i  j 


and  an  integer  vector  A  =  {ajl  j  =  1,  ...  k] 

A  matroid  (E,F)  is  then  defined  where  F  consists  of  all  subsets,  I,  of  E  formed  by 
selecting  no  more  than  a.  elements  of  Ej. 

In  the  case  of  the  TDAP,  the  first  partition  of  the  is  by  person,  i.e., 

Ei  =  {X.jj^lj  =  1,  ...  N;  k  =  1,  ...  N] 


and  3.  =  1  for  all  i.  The  independent  sets  in  this  first  matroid  correspond  to 
assigning  each  person  at  most  one  job.  Similarly,  two  more  matroids  can  be  de¬ 
fined  to  constrain  jobs  and  days.  Intersections  of  these  three  matroids  correspond 
to  feasible  partial  assignments  and  intersections  of  maximum  cardinality  correspond 


to  feasible  compiete  assignments.  If  we  define  weights  W..j^  associated  with  th 


w...  =  c  -  c.., 

ijk  ijk 


where  C  is  larger  Lhan  any  then  the  maximum  weight  intersection  corre¬ 

sponds  to  the  optimal  solution' to  the  TDAP. 


Augmenting  Paths 

We  now  define  a  family  of  algorithms  for  the  solution  of  matroid  intersection  prob¬ 
lems.  These  are  generalizations  of  the  basic  procedure  given  in  [6]. 

Given  a  matroid  (E.F)  (and  hence  a  notion  of  independence)  and  a  (not  necessarily 
independent)  subset  S,  of  E.  we  define  the  span  of  S,  denoted  sp(S),  as  S  to¬ 
gether  with  all  elements  of  E  not  independent  of  the  elements  in  S,  that  is 

sp(S)  =  {  e  I  I  o  {e}  ^  F  where  I  is  any  independent  subset  of  S} 

If  S  is  an  independent  set  and  e  e  sp(S)  then  e  forms  a  unique  cycle,  which  we 
denote  by  C(e),  with  S.  A  cycle  is  a  dependent  set  which  becomes  independent  if 
any  element  is  removed  from  it. 

• 

If  the  matroid  intersection  problem  only  involves  two  matroids,  we  can  obtain 

(K) 

a  maximum  weight  intersection  by  producing  a  sequence  of  intersections,  r  \ 
containing  K  elements,  for  K  =  1,2,  ...  m.  Each  is  the  maximum  weight 

fV\ 

intersection  containing  K  elements.  The  algorithm  which  produces  the  P'  *  is 

f  K  ^  ( K  + 

called  an  augmenting  path  procedure  because  it  augments  P  'to  produce  P  ' 
’^7  finding  the  longest  path  in  the  graph  defined  below. 

We  define  G^^^  to  be  a  bipartite  graph  with  nodes  corresponding  to  the  elements, 
e.,  of  E  plus  distinguished  start  and  finish  nodes,  a  and  z.  Directed  arcs  are 
defined  as  follows: 


(a,i) 

i  e  E  -  spxd^*'^^) 

(i.j) 

i  e  E  -  j  e  C^^\i) 

(i,z) 

i  e  E  -  spad^^^) 

/ 

(j,i) 

i  e  E  -  j  c  C^^\i) 

Paths  from  a  to  z  correspond  to  augmentations  of  that  is,  to  sets  of  elements 

to  be  added  or  deleted  from  P  'to  produce  an  intersection  with  K  1  elements. 

(K) 

Notice  that  all  a  to  z  paths  go  alternately  through  nodes  not  contained  in  P  ' 
(which  are  to  be  added  to  I^^^)  and  nodes  in  (which  are  to  be  deleted). 

Note  also  that  there  is  one  more  node  of  the  former  type  than  there  is  of  the 
latter  and  hence  an  augmentation  results.  If  we  associate  lengths  with  the  arcs 
equal  to  the  weights  of  the  elements  which  the  nodes  correspond  to  (positive  for 
elements  to  be  added  and  negative  for  elements  to  be  deleted),  then  the  length  of 
a  path  corresponds  to  the  incremental  weight  of  the  augmentation.  The  longest 


path  results  in  an  optimal  augmentation.  Such  a  path  can  be  found  using  a  short- 

f  K) 

est  path  algorithm  suitably  modified  to  find  longest  paths.  G  '  contains  no 
positive  cycles  and  so  the  algorithm  converges. 

These  augmentations  do,  indeed,  result  in  intersections.  As  one  passes  through 
nodes  from  a  to  2  we  see  that  an  element  is  added  preserving  independence  in  the 
first  matroid  but  not  the  ^second.  An  element  is  then  deleted  restoring  indepen¬ 
dence  in  the  second  matroid  and  hence  the  intersection.  A  node  is  then  added 
which,  because  of  the  deleted  node,  maintains  independence  in  the  first  matroid. 
This  process  continues  until  the  added  element  maintains  independence  in  the 
second  matroid  as  well  as  the  first,  thus  completing  the  augmentation. 

As  an  example,  consider  a  two  dimensional  assignment  problem  (involving,  say, 

(2'i  . 

only  people  and  jobs.)  The  W^'s  for  this  problem  are  given  in  Figure  1,  P  is 
clearly  11,22,  i.e.,  person  1  assigned  to  job  1,  and  person  2  assigned  to  ^ 

is  shown  in  Figure  2.  The  arc  lengths  are  shown  as  are  the  lengths  of  the 

longest  paths  to  each  node  from  node  a.  The  longest  a  to  2  path  is  a, 11, 12, 22, 23, 2 

which  corresponds  to  deleting  11  and  22  from  the  intersection  and  adding  31,12, 

(3) 

and  23.  The  length  of  this  path,  7,  is  the  difference  between  the  weight  of  !'■  ^ 
and  A  complete  description  of  this  process  and  a  proof  of  its  validity  is 

given  in  [5]. 


Generalized  .^ugumentinq  Paths 

In  the  graph  shown  in  Figure  2,  one  can  obtain  an  optimal  augmentation  (i.e.,  one 
which  takes  us  from  an  optimal  assignment  of  K  ele.ments  to  an  optimal  assignment 
of  K  +  1)  because: 

1.  If  the  current  intersection  is  not  maximal  then  an  augmenting  path  exists. 

2.  The  labels  given  to  the  nodes  during  the  longest  path  algorithm  completely 
summarize  the  augmenting  paths. 

We  now  wish  to  generalize  the  notion  of  an  augment!:. 7  path,  and  hence  the  entire 
procedure,  to  the  problem  of  the  intersection  of  three  matroids.  One  way  of  doing 
this  is  to  "freeze"  one  of  the  matroids  a'nd  only  consider  alternating  sequences 
within  the  other  two.  In  this  case  the  first  node,  S;  in  an  augmenting  path  would 
be  independent  of  I^^^  in  two  of  the  three  matroids  (or  in  all  three,  in  which  case 
it  is  the  only  node  in  the  augmenting  path).  Say  Si  is  independent  of  I^^^  in  the 
first  and  third  matroids.  We  could  then  freeze  the  third  matroid  and  maintain  the 
same  span  within  the  third  matroid  throughout  the  augmenting  path.  Thus,  the 
deletion  of  Sj  for  i  even  reduces  this  span  and  the  addition  of  s.  for  i  odd  restores 
it.  We  thus  reduce  the  search  space  to  two  matroids  and  the  same  polynomial 
bounded  procedure  will  work.  Note  that,  alternatively,  we  could  have  considered 
the  first  matroid  frozen.  Indeed,  it  is  so  frozen  in  the  two  matroid  intersection 
algorithm.  Thus,  there  are  three  types  of  augmenting  paths,  one  for  each  matroid 


within  which  Si  is  dependent.  Unfortunately,  while  this  procedure  is  polynomial 
bounded,  it  does  not  guarantee  optimal  solutions  as  there  are  augmentations  which 
have  no  such  corresponding  argumenting  path. 

In  order  to  guarantee  that  all  augmentations  are  explored,  we  must  relax  the 
definition  of  an  augmenting  path  still  further  to  include  cases  where  independence 
is  not  necessarily  restored  by  the  deletion  of  s.  for  i  even.  Thus,  an  augmenting 
path  may  start  with  any  element,  Si,  which  is  independent  of  P  '  in  at  least  one 
of  the  matroids.  Unlike  the  procedure  given  for  two  matroids,  one  may  begin  with 
independence  in  any  matroid.  Consider  the  graph  shown  in  Figure  3  corresponding 
to  two  augmenting  paths.  Path  1  and  Path  2,  for  the  partial  assignment  111,222,333 
(i.e.,  person  1  to  job  1  on  day  1,  etc.)  in  a  TDAP.  These  paths  are  not  strictly 
comparable  in  that  they  exclude  different  elements  along  the  way.  Thus  in  Figure 
2.  when  node  22  is  labeled  using  the  path  a, 32, 22  it  is  equivalent  (in  terms  of  how 
the  path  can  continue,  not  necessarily  in  terms  of  the  numerical  value  of  the  label) 
to  being  labeled  using  the  path  a, 31, 11, 12, 22.  In  Figure  3,  however,  when  node 
111  is  labeled  using  the  paLh  a, 411, 111  it  is  different  from  labeling  111  using  the 
path  a,  154, 111  because  different  continuations  of  these  paths  are  possible.  Thus 
starting  with  a, 411, 111  we  can  continue  to  152  but  not  215  and,  conversely,  start¬ 
ing  with  a,  154, 111  we  can  continue  with  215  but  not  152.  Thus,  Path  1  and  Path 
2  are  not  comparable  in  terms  of  their  lengths  only. 

Such  paths  must  also  be  compared  in  terms  of  their  spans.  We  note  that  if  two 
paths  from  a  to  some  node  i  result  in  sets  having  identical  spans  then  the  same 
continuations  of  both  paths  are  possible.  (This  was  the  case  for  intersections  of 
rwo  matroids.)  Indeed,  it  is  possible  for  paths  to  have  slightly  different  spans 
and  still  have  the  same  set  of  possible  continuations.  In  particular,  if  the  only 
difference  in  the  intersections  of  the  spans  of  two  paths  are  nodes  outside  the 
intersection  of  the  spans  of  P  \  then  the  paths  are  comparable.  We  can  thus 
generalize  the  augmenting  path  procedure  to  consider  ail  undominated  a  to  z  paths 
where  one  path  dominates  another  only  if  it  has  the  same  continuations  and  a 
larger  length. 


The  notion  of  a  path  itself,  however,  must  be  generalized  as  well.  In  the  case  of 
3  matroids,  not  all  augmentations  correspond  to  paths.  We  see  an  example  of  this 
for  a  TDAP.  The  augmentation  [412,234,341,123]  -  [111,222,333]  does  not  cor¬ 
respond  to  any  path  in  the  conventional  sense.  It  is  possible  however,  to  extend 
the  augmenting  path  procedure  to  include  such  augmentations  by  extending  the 
notion  of  a  path. 

We  define  a  generalized  augmenting  path  with  respect  to  an  intersection  P  ^  to  be 


r(K) 


for  odd  i  and  Sj  e 


a  sequence  of  nodes  S  =  (Si,S2,  ...  s  )  where  s.  e  E  -  I 

r  '  for  even  i.  As  before,  r‘  '  +  s,  -  s,  +  S3  -  ...  +  s„  is  an  intersection. 

‘  m 


Also, 


the  even  Sj  are  deleted  in  order  to  remove  dependencies  created  by  the  inclusion 


of  the  odd  s..  Now,  however,  the  subsequences  +  Sj  -  S2  +  ...-Sj  for  even  j 
need  not  correspond  to  intersections. 

One  can  thus  guarantee  an  optimal  intersection  as  in  the  case  of  two  matroids.  The 
number  of  generalized  augmenting  paths  one  may  need  to  consider,  however,  may 
grow  exponentially  with  K.  In  practice,  however,  the  number  of  such  paths  can' 
be  controlled  at  the  expense  of  optimality.  First,  the  length  of  any  path,  (Si,  Sj, 

. . . ,  s.) ,  can  be  reduced  by  a  penalty  to  account  for  the  nodes  which  still  must  be 
deleted  to  restore  the  intersection.  In  the  case  of  arbitrary  matroids,  this  may  be 
complex  to  compute.  In  the  case  of  the  TDAP,  however,  where  3  partition 
matroids  are  involved,  and  all  cycles  contain  2  elements,  it  is  easily  computed. 

In  some  cases  the  above  may  keep  the  computations  reasonable.  In  others,  it  may 
be  necessary  to  reduce  the  number  of  paths  considered  by  relaxing  the  definition 
of  dominance.  This  will  also  result  in  a  heuristic  rather  than  an  optimal  solution. 
In  the  case  of  the  TDAP,  one  such  relaxation  is  to  ignore  differences  in  the  spans 
outside  the  intersection  of  the  span  of  This  is  motivated  by  the  fact  that 

we  consider  deleting  elements  in  I^^^  in  order  to  include  elements  blocked  by 
them. 

We  can  thus  consider  a  hierarchy  of  generalized  augmenting  path  procedures  with 
increasingly  stringent  dominance  criteria  and  increasing  runtime.  A  tradeoff 
between  optimality  and  runtime  is  then  available.  We  are  currently  investigating 
this  tradeoff  using  the  TDAP  as  an  example. 
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Abstract 

The  problem  considered  is  that  of  finding  an  optimal  (minimum 
cost)  design  for  a  centralized  processing  network  given  a  set  of 
locations,  traffic  magnitudes  between  these  locations,  and  a  single 
common  soturce  or  destination.  Several  heuristics,  which  are  efficient 
(in  terms  of  their  execution  time  and  memory  requirements  on  a 
digital  computer)  and  which  produce  seemingly  good  results,  have 
already  been  developed  and  are  currently  accepted  techniques.  Some 
work  has  also  been  done  on  finding  optimal  solutions  to  this  problem 
both  as  a  design  tool  and  as  a  means  of  verifying  the  effectiveness  of 
proposed  heuristics.  We  focus  in  this  latter  area.  Currently  known 
techniques  for  the  optimal  solution  of  this  problem  via  integer  pro¬ 
gramming  have  fallen  short  of  the  desired  objectives  as  they  require 
too  much  memory  and  ninning  time  to  be  able  to  treat  problems  of 
realistic  size  and  complexity.  We  develop  an  improved  technique 
which  is  capable  of  handing  more  realistic  problems. 
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INTRODUCTION  AND  PROBLEM  STATEMENT 

The  problem  considered  is  that  of  finding  an  optimal  (minimum 
cost)  design  for  a  centralized  telecommunication  network  given  a  set 
of  locations,  traffic  magnitudes  between  these  locations,  and  a  single 
common  source  or  destination.  The  vast  majority  of  telecommunication 
networks  currently  in  existence  are  of  this  type.  Thus,  this  problem 
has  been  much  studied  (2,3,4,5,8,9,12,16,24,28,31,32). 

Several  heuristics,  which  are  efficient  (in  terms  of  their  execu¬ 
tion  time  and  memory  reqmrements  on  a  digital  computer)  and  which 
produce  seemingly  good  results,  have  already  been  developed  and  are 
ctirrently  accepted  techniques.  Some  work  has  also  been  done  on 
finding  optimal  solutions  to  this  problem  as  a  means  of  verifying  the 
effectiveness  of  proposed  heuristics.  Currently  know  techniques  for 
the  optimal  solution  to  this  problem  via  integer  programming  have 
fallen  short  of  the  desired  objective  as  they  require  too  much  memory 
and  running  time  to  be  able  to  treat  problems  of  realistic  size  and 
complexity.  We  develop  an  improved  technique  which  is  capable  of 
handling  problems  of  realistic  size. 

xMore  formally,  the  problem  considered  here  is  that  of  finding  a 
minimum  spanning  tree  subject  to  one  or  more  constraints  which  in 
general  are  equivalent  to  demanding  that  the  sum  of  the  traffic  asso¬ 
ciated  with  the  nodes  in  any  subtree  must  not  exceed  some  predeter¬ 
mined  maximum. 

A  minimum  spanning  tree  is  a  loop-free  collection  of  arcs  joining 
a  set  of  nodes  such  that  the  sum  of  the  lengths  of  the  arcs  is  mini¬ 
mal.  In  the  case  of  a  communication  network,  these  collections  of 
arcs  are  called  multidrop  lines. 
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It  should  be  noted  that  this  constraint  form  is  quite  general  and 
encompasses  many  real-world  constraints  which  arise  in  the  design  of 
centralized  telecommunications  networks.  Thus,  for  example,  in 
addition  to  treating  the  obvious  constraint  imposed  by  line  capacity, 
it  is  possible  to  treat  a  restriction  on  the  number  of  terminals  on  a 
multidrop  line  by  associating  a  uniform  traffic  with  each  terminal. 
Also,  the  length  (cost)  functions  which  can  be  treated  are  quite 
general.  Any  function  which  is  not  a  function  of  the  tree  chosen  is 
permissable . 

Formally,  we  seek  to  solve  the  following  problem: 

Given 

1.  A  vertex  (node)  set  V  =  {v.|i=0,l, . . .  ,n}  representing  the 
terminal  locations  in  the  network.  Node  Vq  is  a  distin¬ 
guished  node  which  we  will  refer  to  as  the  center. 

2.  A  symmetric  function  giving  the  length  (cost)  dy  of  an  arc 
between  any  pair  of  locations. 

3.  A  constraint,  m,  on  the  number  of  nodes  which  may  share  a 
multidrop  line.  This  constraint  can  be  generalized  to  allow 
a  weight  or  traffic,  c.,  to  be  associated  with  each  node  and 
to  require  that  the  sum  of  the  weights  associated  with  the 


nodes  on  any  multidrop  line  not  exceed  m. 

f  h 

We  define  the  set  of  nodes  in  the  j  multidrop  line  to  be  Vj  and 

the  multidrop  line  itself  to  be  a  minimal  spanning  tree  T^  on  Vj  i/{v^]. 

Thus,  the  constraint  can  be  stated  in  terms  of  the  cardinality  of 

as  |V. I  <  m  V. .  In  the  more  general  form,  the  constraint  would  be 
11  * 

Z  C;  <  m  Vj.  We  wish  to  find  a  tree,  T..  of  minimum  total 
Vi  e  Vj  ■  -  ) 

length  satisfying  the  constraint  in  3  above.  That  is,  we  wish  to 


N 

minimize  Z  d.p  subject  to  3 ,  where  Vp  is  the  immediate  predecessor 

of  Vp  i.e.,  the  node  closest  to  v.  on  the  path  between  v.  and  Vq  in 
T,  and  T  is  any  spanning  tree.  We  consider  exact  (optimal)  solutions 
to  this  problem.  The  primary  motivation  for  the  work  is  to  develop 
an  exact  algorithm  capable  of  permitting  study  of  the  performance  of 
heuristics  on  a  broader  class  of  problems  than  was  previously  studied, 
to  gain  insight  into  the  performance  of  both  exact  and  heuristic 
procedures  and,  in  particular,  to  pinpoint  where  and  why  they  fail. 
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II.  OUTLINE  OF  A  NEW  OPTIMAL  SOLUTION  TECHNIQUE 

There  currenty  exist  several  techniques  which  will  yield  optimal 
solutions  to  the  CMST  problem.  These  techniques  can  be  divided  into 
two  classes  -  branch  exchange  methods  (as  proposed  by  Lin  (22)  and 
Frank  (9))  and  branch  and  bound  methods  (10,22).  We  concentrate 
on  the  latter  class  of  techniques. 

The  specific  application  of  branch  and  bound  techniques  to  the 
solution  of  the  CMST  problem  was  proposed  by  Chandy  and  Russell 
(3)  and  was  subsequently  refined  (2)  so  that  it  could  treat  somewhat 
more  meaningful  problems.  Subsequently,  Elias  and  Ferguson  (4) 
proposed  further  refinements  and  thereby  expanded  the  range  of 
applicability  of  the  technique.  Gavish  (34)  recently  developed  a 
bound  using  Lagrangean  relaxation. 

The  basic  technique  is,  as  has  already  been  mentioned,  a  branch 
and  bound  algorithm.  The  original  problem  considered  has  all  branches 
in  the  category,  "permissible,"  i.e.,  any  branch  may  or  may  not  be 
part  of  the  final  solution.  Subproblems  are  generated  by  selecting  a 
permissible  branch  and  making  it  "prohibited"  in  one  subproblem  or 
"required"  in  another. 

The  relaxation  used  is  simply  to  generate  a  modified  MST  by 
including  all  "required"  branches,  excluding  all  "prohibited"  branches, 
and  forming  the  tree  of  minimum  total  length  by  connecting  (as  yet 
unconnected)  nodes  using  remaining  ("permissible")  branches. 

Clearly,  a  solution  obtained  in  this  manner  is  a  lower  bound  on 
the  value  of  a  feasible  solution  to  the  subproblem  as  it  is  the  tree  of 
minLmum  length.  Note  also  that  in  the  case  where  ail  arcs  are  speci¬ 
fied  (prohibited  or  required),  the  lower  bound  and  solution  are 
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identical  and  the  subproblem  fathoms.  In  general,  Lhe  subproblem 
fathoms  when 

1.  No  feasible  solution  exists  to  the  subproblem.  This  occurs, 
when  the  required  branches  form  a  loop,  when  the  required 
branches  create  a  subtree  violating  the  constraints,  or 
when  the  prohibited  branches  disconnect  the  network. 
Other  criteria  exist  but  are  difficult  to  test  for. 

2.  The  lower  bound  equals  or  exceeds  the  value  of  the  best 
solution  found  thus  far. 

3.  The  lower  bound  solution  is  feasible. 

When  all  subproblems  have  fathomed,  the  current  best  solution  is 
the  global  optimum. 

A  number  of  observations  have  been  made,  which  can  be  used  to 
accelerate  a  basic  branch  and  bound  technique.  One  of  these,  which 
is  used  in  the  sequel,  is  given  in  Theorem  1  below: 

Theorem  1:  (3) 

If  branches  (vQ,Vj  ),  (vQ,Vj  ) . 

12  IC 

some  MST,  T,  on  V  then  there  exists  a  CMST  including 
these  edges. 

Corollary : 

If  arcs  (vQ,Vj  ),  (Vq,v.  ) .  ^  present  in  the 

modified  MST  produced  in  any  subprobiem,  then  (if  any 
CMST's  exist  in  the  subproblem)  there  exists  a  CMST  on 
the  subproblem  containing  these  arcs. 

The  preceding  theorem  and  corollary  allow  one  to  avoid  consider¬ 
ing  subproblems  with  such  arcs  prohibited.  The  techniques  devel¬ 
oped  in  the  sequel  make  explicit  use  of  both  observations  as  well  as 
others  made  in  Lhe  references  cited. 


i 
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The  inherent  problem  with  the  existing  procedures  lies  in  the 
relaxation  method  used.  At  each  step,  the  problem  is  relaxed  to  a 
modified  MST.  Unfortunately,  this  bound  is  often  too  loose  to  elimi¬ 
nate  a  sufficiently  large  percentage  of  the  subproblems  to  make  the 
procedure  practical.  This  is  particularly  evident  when  Lhe  constraints 
are  tight;  it  is  for  such  problems  that  the  relaxation  is  loosest. 
UnforUinately ,  it  is  also  for  that  class  of  subproblems  that  the  known 
heuristics  display  the  widest  variation  in  the  quality  of  solutions . 

Note  that  any  optimal  solution  to  the  CMST  problem  has  the 
property  that  all  subtrees  are  MST's  on  the  set  of  nodes  contained  in 
the  subtree  and  the  center.  Thus,  it  suffices  to  find  the  optimal 
partition  of  the  nodes  into  subtrees.  The  technique  which  is  devel¬ 
oped  in  the  following  sections  will  thus  generate  partitions  of  the 
nodes . 

The  technique  works  within  the  framework  of  branch-  and  bound 
algorithms,  as  did  the  techniques  referred  to  above.  We  develop  two 
algorithms,  one  based  on  generating  subproblems  by  restricting 
nodes,  and  the  other  based  on  generating  subproblems  by  restricting 
arcs.  These  techniques  differ  from  previous  ones  in  that  the  relaxa¬ 
tion  used  here  is  tighter  and  thus,  a  smaller  number  of  subproblems 
need  be  examined. 

We  begin  by  restricting  the  problem  slightly.  We  seek  a  CMST 
subject  to  the  constraint  that  the  number  of  nodes  (rather  than  the 
sum  of  the  weights  of  the  nodes)  in  any  subtree  not  exceed  a  pre¬ 
specified  maximum.  Since  our  primary  intent  here  is  to  study  the 
performance  of  CMST  algorithms,  this  modification  would  not,  in 
general,  have  a  significant  effect.  Indeed,  if  one  preferred,  a  node 


of  weight  K  could  be  replaced  by  K  nodes  of  weight  1,  providing  one 
is  willing  to  allow  the  original  node  of  weight  K  to  be  split  among 
more  than  one  subtree. 

To  find  a  partition  of  the  nodes  V  =  {vj  i  =  l,2,...n}  into 
subtrees,  we  begin  by  making  n  copies  of  each  node  corresponding  to 
the  possibilities  of  the  node  being  in  any  of  n  possible  subtrees. 
Thus,  v.j  corresponds  to  node  v.  being  in  subtree  j. 

The  problem  of  obtaining  an  optimal  partition  of  nodes  into 
subtrees  can  be  thought  of  as  one  of  selecting  an  optimal  subset  from 
the  set  E  =  (v^l  i=l,2,...n;  j=l,2,...n}.  Feasible  subsets  of  E, 
i .  e . ,  those  corresponding  to  partitions  satisfying  the  capacity  con- 
straiint,  will  contain  1  v..  for  each  i  and  at  most  m  v^.'s  for  each  j. 
If  we  associate  a  weight,  w.^,  with  each  the  optimal  subset  of  E 
(hence  the  optimal  partition  of  V)  is  defined  as  the  feasible  subset  of 
minimum  total  weight. 

.^n  efficient  algoriLhm  (see  14,  21,  35)  exists  for  the  solution  of 
the  proble.m  of  finding  the  optimal  subset  of  E  given  the  values  of 
w.. .  The  algorithm,  which  can  be  thought  of  as  a  matroid  interesec- 
tion  [1,17,19,20,29]  algorithm  or  alternatively  as  a  series  of  shortest 
path  problems  in  appropriately  defined  graphs,  has  a  worst  case 
running  time  of  order  n^  and  in  practice  has  a  running  time  closer  to 
order  n^  (see  35).  Unfortunately,  the  set  of  weights,  w.^. ,  which 
correspond  directly  to  the  ’’cost"  (contribution  to  the  overall  length 
of  the  CMST)  of  v.  in  subtree  j  can  be  specified  only  when  the  pro¬ 
blem  solution  is  already  known.  We  can,  however,  define  a  set  of 
weights,  w..,  which  have  the  property  that  the  optimal  partition  found 
using  these  weights  will  ha^a  a  value  (sum  of  weights)  which  is  a 
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lower  bound  on  the  length  of  the  optimal  CMST.  Thus,  we  can  relax 
the  CMST  problem  to  the  problem  of  finding  an  optimal  partition. 
This,  together  with  generating  subproblems  by  successively  restrict¬ 
ing  either  nodes  or  arcs,  gives  rise  to  an  optimum  CMST  algorithm 
within  the  branch  and  bound  context. 

An  appropriate  set  of  weights,  w.^,  can  be  defined  as  follows. 
Suppose  we  are  given,  for  each  subtree  •,  the  set  Vj  of  nodes  per¬ 
mitted  in  the  subtree;  a  procedure  for  obtaining  the  will  be  given 
below.  One  can  then  find  Tj,  the  minimum  spanning  tree  on  the 

nodes  in  V.  {v  }.  Let  d..  be  the  length  of  the  arc  connecting  v. 

]  0  1]  ^  1 

to  its  predecessor  in  Tj  (i.e.,  d.j  is  the  length  of  the  last  arc  in  the 
path  from  Vq  to  v.  in  T^).  The  following  theorem,  which  is  proven  in 
(35),  allows  us  to  obtain  appropriate  w..; 

Theorem  2:  The  weight  of  the  optimal  partition  using  w.j  =  d^j  is  a 
lower  bound  on  the  length  of  the  CMST  for  the  same  V  and  .M 

Furthermore,  it  is  proven  in  (35)  that  other  similarly  .defined 
w.j's  also  preserve  this  lower  bound.  In  particular,  suppose  Tj 
contains  a  path  (v^,...Vp,  v^, . . .  v^, . . .  v^, . . . )  as  shown  in  Figure 
1.  Let  S  be  the  set  of  nodes  •  •  •'^1^' •  •  •'^3> 
largest  weight  of  any  node  in  S .  Suppose  w^ ,  >  w,^. .  Define  A  = 
Wp.  -  Wj^j.  Then  the  following  theorem  holds: 

Theorem  3:  If  a  set  of  weights  w.^  =  d^^  is  modified  by  transferring 
weight  A  from  w^.  to  w^,  where  A,  v^  and  v^  are  defined  as  above, 
the  weight  of  the  optimal  partition  is  still  a  lower  bound  on  the  length 
of  the  CMST  for  the  same  V  and  m. 

Theorem  3  allows  us  to  transfer  weight  from  a  node  to  its  suc¬ 
cessors  in  Tj  in  order  to  guarantee  that  the  lower  bound  obtained 
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from  the  partitioning  problem  is  at  least  as  tight  as  the  bound  ob¬ 
tained  using  an  MST,  as  is  done  in  (2),  (3),  and  (4).  A  proof  that 
this  can  always  be  done  is  given  in  (35).  As  an  example  of  how  this 
works,  consider  the  network  shown  in  Figure  2a.  The  MST  for  this 
network  and  the  node  weights  corresponding  to  it  are  shown  in  Figure 
2b.  These  weights  correspond  to  the  bound  obtained  using  an  MST. 
Suppose,  however,  that  we  restrict  V3  from  being  part  of  a  given 
subtree  j.  The  weights  shown  in  Figure  2c  would  then  be  obtained  if 
we  simply  set  w..  =  d...  Note  in  particular  that  w-.  has  been  reduced 

i]  1] 

from  5  to  1.  This  reduction  in  w^j  could  result  in  a  'oosening  of  the 
lower  bound.  Theorem  3  allows  us  to  transfer  up  to  A  =  w^.  -  w^j, 
i.e.,  7  units  of  weight,  from  w^^^  to  W2j  and  obtain  the  weights  shown 
in  Figure  2d.  .Note  that  the  w.^  in  Figure  2d  are  at  least  as  great  as 
the  w..  in  Figure  2b.  Thus,  the  lower  bound  obtained  using  Lhe  w^. 
in  Figure  2d  will  be  at  least  at  tight  as  that  obtained  using  an  MST. 
In  fact,  the  bound  so  obtained  is  significantly  tighter,  as  is  shown 
by  the  computational  experience  given  in  Section  V. 

We  now  turn  to  the  question  of  the  branching  rule  within  the 
branch  and  bound  procedure.  Little  was  said  by  Chandy  and  Russell, 
Chandy  and  Lo,  and  Elias  and  Ferguson  on  the  order  in  which  sub¬ 
problems  are  considered  in  the  branch  and  bound  procedure.  Classi- 
cially,  two  approaches  are  available.  The  first  is  to  always  consider 
the  subproblem  with  the  least  lower  bound.  Alternatively,  one  can 
use  depth  first  search,  where  one  always  solves  most  recently  gener¬ 
ated  subproblems  before  returning  to  older  subproblems.  There  are 
advantages  and  disadvantages  to  both  approaches. 
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The  first  approach  allows  one  to  proceed  without  any  good 
feasible  solutions  to  guide  the  process.  The  assumption  is  that 
subproblems  with  the  lowest  lower  bounds  will  give  rise  to  the  best 
feasible  solutions.  Hence,  one  prefers  to  explore  these  subproblems 
first  in  the  hope  that  they  will  give  rise  to  low  cost  feasible  solutions 
which  will  eliminate  other  subproblems  (with  higher  lower  bounds) 
from  consideration.  .Also,  by  examining  subproblems  in  this  order, 
one  is  continually  narrowing  the  range  between  the  upper  and  lower 
bounds,  and  hence,  has  the  option  of  terminating  the  algorithm  when 
the  interval  shrinks  to  some  prespecified  width. 

There  are,  however,  two  major  drawbacks  to  this  approach. 
First,  one  must  keep  (a  potentially  large  number  of)  subproblems 

around  in  order  to  select  the  next  one.  Thus,  the  storage  required 

« 

for  the  procedure  is  potentially  exponential.  In  practice,  it  was 
storage,  not  running  time,  which  was  the  active  constraint  on  proble.m 
size  in  previously  developed  techniques.  One  could  temporarily  store 
subproblems  in  secondary  storage,  but  this  would  complicate  and  slow 
down  the  procedure. 

Second,  by  considering  problems  in  ascending  order  of  lower 
bound,  one  will,  in  general,  be  sequentially  considering  dissimilar 
subproblems.  Thus,  one  cannot  easily  take  advantage  of  information 
obtained  in  the  solution  of  one  problem  for  the  solution  of  another. 
For  example,  in  the  Elias  and  Ferguson  technique,  the  similarity 
between  modified  MST's  for  related  subproblems  cannot  be  easily 
exploited  if  this  first  procedural  outline  is  adopted. 

Using  depth  first  search  overcomes  both  of  these  objections. 
Indeed,  a  great  deal  of  simplification  is  obtainable  both  in  the  genera- 
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tion  of  subproblems  and  in  obtaining  solutions  owing  to  the  similarity 
of  successively  considered  subproblems. 

The  maximum  number  of  subproblems  which  need  be  kept  around 
at  any  time  is  bounded  by  the  number  of  nested  specifications  it  is 
possible  to  make.  Thus,  if  one  is  restricting  nodes,  the  bound  is  n; 
if  one  is  including  or  excluding  arcs,  the  bound  is  C^)-  This  essen¬ 
tially  eliminates  storage  as  an  active  constraint  on  the  size  of  the 
problem  which  can  be  considered. 

A  fxurther  reason  for  using  depth  first  search  is  that  the  major 
reasons  one  would  ordinarily  choose  the  first  procedure  are  not 
present  here.  Any  of  the  existing  heuristics  can  be  used  to  quickly 
generate  a  good  upper  bound.  Furthermore,  the  procedures  devel¬ 
oped  in  the  sequel  lend  themselves  to  generating  feasible  solutions  for 
all  subproblems.  Thus,  a  good  upper  bound  is  always  available. 

Hence,  depth  first  search  is  used  in  developing  the  techniques 
in  the  sequel.  It  should  be  further  noted  that  the  philosophy  used 
in  developing  these  techniques  was  to  create  the  simplest,  most  fle.x- 
ible  framework  within  which  to  work  so  that  a  variety  of  acceleration 
techniques  could  be  developed  and  tested.  The  concentration  is  on 
restricting  the  number  of  subproblems  examined  (which  is  exponential 
in  n)  rather  than  the  amount  of  work  spent  on  each  subproblem 
(which  is  a  low  order  polynomial  in  n). 


III.  NODE  PARTITIONING 


The  first  exact  technique  built  around  the  above  relaxation  is 
one  which  generates  subproblems  by  restricting  the  subtrees  a  node 
is  allowed  into.  The  procedure  is  described  below.  We  begin  by 
describing  the  initiali2ation  procedure. 

Step  0:  (Initialize) 

* 

0.1)  Find  an  upper  bound,  z  ,  using  a  heuristic  to  generate 
a  good,  feasible  solution. 

0.2)  Find  an  MST,  T,  and  identify  arcs  (Vq»v.  ),  (Vq,v.  ), 


0.3) 

0.4) 

0.5) 

0.6) 

0.7) 

0.8) 


Reorder  the  nodes  so  that  v.  ,v . v.  are  now  v. , 

T  ^  2  ^  Ic  ^ 

'^2 . '^k- 


For  1  1  n,  set  R.^  = 


TRUE  i  ^  j 
FALSE  i  =  j 


[R..  is  a  logical  variable  which  is  set  to  TRUE  if  v..  has 
been  removed  from  consideration  in  this  subproblem. 


Observations  made  above  allow  us  to  remove  some  v.. 
immendiately  ] . 


For  k  <  i  <  n,  set  Rjj  = 


^TRUE  i  >  j 
^  FALSE  i  <  j 


For  i  1  j  <  ft,  find  an  MST,  Tj,  on  V.  where 

Vj  =  {Vj|R|j  =  FALSE}  i.e.,  V.  is  the  set  of  nodes 
permitted  in  subtree  j. 


Set  w.j  =  d.j,  where  d-  is  the  distance  from  v.  to  its 
predecessor  in  T^. 

For  j  =  1,2,...,  k,  exchange  weight  between  w..  for 


different  values  of  i  so  that  the  resulting  modified 
weights,  w.'^  ,  satisfy: 

'^ij  ^"'iO 

where  w.q  =  d.p  in  the  unconstrained  MST,  T,  gener¬ 
ated  in  Step  0.2  above. 

0.9)  For  j  =  k+l,...,  n,  exchange  weight  between  w.^  so 
that  the  resulting  w/j  satisfy 

"I’i  i  "I'i  -  ^ 

The  justification  for  all  of  these  steps  was  given  in  Section  2. 

Steps  0.8  and  0.9  guarantee  that  the  individual  w/.  will  all  be  at 
least  as  great  as  the  weights  assigned  using  unconstrained  MST. 
Hence,  this  is  a  realization  of  the  statement  that  the  lower  bound 
obtained  using  this  procedure  must  be  at  least  as  great  as  the  lower 
bound  obtained  using  an  MST.  This  also  holds  true  for  subproblems. 

Thus,  we  have  initialized  a  subproblem  with  nodes  1,2 _ _  k  forced 

into  subtrees  1,2,...,  k,  respectively,  since,  for  i  <  k,  Rj.  =  TRUE 
for  i  j.  In  the  course  of  the  depth  first  search,  we  keep  track  of 
the  following  variables: 

d  =  The  depth  of  the  search,  i.e.,  the  number  of  nodes 
which  have  been  forced,  d  is  initialized  to  k. 

^MIN  ~  minimum  allowable  depth,  is  initialized  to  k 

since  at  least  k  nodes  should  will  be  forced. 

IW^  =  The  subtree  which  the  d  node  is  forced  into. 

The  depth  first  search  proceeds  by  forcing  node  k+1  into  sub¬ 
tree  1;  i.e.,  d  is  set  to  k+1  and  is  set  to  1.  It  continues 

either  by  increasing  d  (to  force  another  node)  changing  IW^  (to  force 
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a  node  into  a  different  subtree)  or  decreasing  d  (to  release  a  node 
after  forcing  it  successively  through  all  subtrees).  The  depth  first 
search  procedure  follows. 


Depth  First  Search  Procedure 


Step  0:  Initialize  problem  (Steps  0.1  through  0.9  above). 


Set  d  =  k+1 


Set  =  k+1  Set  IW^  =  1 

Step  1:  Solve  the  currently  defined  subproblem;  i.e.,  find  a 
lower  bound  z^,  and  a  feasible  solution  Zp. 

Step  2:  If  the  current  subproblem  fathoms;  i.e.,  z^^  >  Zp,  go 
to  Step  3;  otherwise  go  to  Step  5. 

Step  3:  Set  IW^  =  IW^  +  1 

If  IW^  >  NMAX^  go  to  Step  4;  otherwise  set  up  a  new 
subproblem  and  go  to  Step  1.  NMAX^  is  the  highest 
indexed  subtree  which  the  node  at  depth  d  may  be 
forced  into.  In  section  3  we  observed  that  one  should 


not  skip  over  subtrees.  Thus: 

NMAX^  =  max  (K,  max  (IW.J)  K  <  i  <  d 
Step  4:  Set  d  =  d+1 

If  d  <  stop;  otherwise  set  up  a  new  subproblem 

and  go  to  Step  1. 

Step  5:  Set  d  =  d+1 
Set  IWj  =  1 

Set  up  new  subproblem  and  go  to  Step  1. 

To  set  up  a  new  subproblem,  one  need  only  modify  the  values  of 
a  few  Ry  to  impose  (or  remove)  the  restriction  implied  by  the  alter¬ 
nation  of  d  and  IW^.  Thus,  after  d  is  set  to  d+1  and  IW^  is  set  to  1; 
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r  FALSE  j  =  1 

‘dr  ( 


TRUE  j  >  1 


After  IW^  is  set  to  IW^  +  1: 
^d,IW^  “  FALSE 

Rd,IW^-l  =  TRUE 


After  d  is  set  to  d  -  1; 

^  FALSE  j  <  NMAX^ 
L  TRUE  j  >  NMAX^ 


R 


d+l.j 


The  search  space  can  be  further  pared  using  the  corollary  to 
Theorem  1.  If,  in  any  subproblem,  one  finds  that  two  nodes,  v.  and 
Vj,  both  forced  into  the  same  subtree  appear  in  separate  subtrees  in 
the  MST  formed  on  the:  set  of  permissable;  nodes  in  that  subtree ,  then 
the  subproblem  may  be  discarded. 

As  was  mentioned,  this  optimal  technique  based  on  generating  a 
partition  lends  itself  simply  to  obtaining  a  feasible  solution  to  each 
subproblem.  The  partition  generated  at  each  step  is  feasible.  One 
need  only  generate  MST's  on  each  group  of  nodes  to  obtain  a  feasible 
solution.  A  simple  acceleration  technique,  which  proved  to  be  quite 
effective  in  practice,  was  to  reorder  the  nodes  v.  .  , . . . ,  v  by 
distance  from  Vq,  nearest  first.  This  tended  to  increase  the  lower 
bound  most  rapidly.  Such  nodes,  when  restricted  to  a  single  sub¬ 
tree,  were  absent  from  all  others,  and  the  "deprived"  subtrees  were 
often  forced  to  connect  to  v^  over  longer  arcs. 

The  Rjj  are  used  in  the  optimal  partitioning  procedure  in  a 
straight  forward  fashion;  any  element  v.^,  with  its  corresponding  R.. 
=  TRUE,  is  considered  to  be  removed  from  the  problem. 
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The  weight  exchange  procedure  described  as  part  of  the  initiali¬ 
zation,  which  guarantees  that  the  weight  on  each  node  will  be  at  least 
as  great  as  its  contribution  to  the  length  of  an  MST,  is  used  here  as 
well.  At  each  level,  K,  in  the  decision  tree,  we  save  the  values  of 

ir 

the  w.'j  in  a  variable  referred  to  as  w^.  We  then  demand  that  w/j  = 
w^j  >  i.e.,  we  exchange  weights  to  enforce  the  restriction.  The 

justification  for  doing  so  is  identical  to  that  used  in  the  initialization 
procedure.  Note  that  this  exchange  guarantees  not  only  that  the 
lower  bound  will  remain  tighter  than  an  MST,  but  also  that  the  lower 
bound  will  be  monotone  with  the  depth  in  the  decision  tree.  Neither 
of  these  things  is  true  without  the  exchange. 


IV.  ARC  RESTRICTIONS 


AnotHc'  method  of  applying  this  relaxation  technique  to  the 
solution  to  the  CMST  problem  is  to  restrict  arcs;  i.e.,  to  force  arcs 
to  be  either  "prohibited"  or  "required"  as  was  done  by  Chandy  and 
Russell  and  Elias  Ferguson.  Thus,  the  initialization  and  subproblem 
solution  are  essentially  the  same  as  they  were  in  the  technique  des¬ 
cribed  in  the  previous  chapter,  but  the  method  of  generating  subpro¬ 
blems  is  different.  The  solution  order  is  still  a  depth  first  search. 

Some  differences  exist  in  the  initialization  procedure.  Instead  of 
forcing  a  node  into  a  subtree,  we  simply  "require"  the  arcs  (i^,0), 
...,  (ij^.O)  which  are  part  of  the  unconstrained  MST.  This  is,  of 
course,  equivalent  to  what  was  done  in  the  previous  case.  It  is 
implemented  in  a  slightly  different  way,  however. 

The  entire  procedure,  both  during  initialization  and  during 
subsequent  subproblem  generation,  restricts  itself  to  dealing  with 
established  arcs,  i.e.,  arcs  which  connect  a  node  directly  to  v^  or  to 
other  nodes  connected  to  Vq  by  established  arcs.  Thus,  each 
"required"  arc  forces  a  node  into  a  given  subtree  and  each 
"prohibited"  arc  forces  a  node  out  of  a  given  subtree.  As  a  new 
subtree  is  encountered  U-e----,  when  an  arc  of  the  form  (Vq,  v.)  is 
made  "required"),  we  simply  assign  the  next  available  subtree  number 
to  the  subtree. 

Subproblems  are  generated  by  successively  restricting  (requiring 
or  prohibiting)  established  arcs.  We  again  use  d  to  represent  the 
depth  of  the  search.  Here,  however,  d  refers  to  the  number  of 
forced  arcs  rather  than  the  number  of  forced  nodes.  Note  that  while 
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the  number  of  required  arcs  is  limited  to  n,  the  number  of  prohibited 
arcs  is  not.  We  thus  have  a  different  type  of  decision  tree  than  we 
did  in  the  previous  section. 

When  forcing  nodes  into  subtrees,  we  dealt  with  a  tree  of  depth 
n  but  with  nodes  of  degree  sometimes  as  great  as  n.  Here,  we  deal 
with  a  binary  tree  of  depth  as  great  as  C^)-  It  is  not  clear,  how¬ 
ever,  especially  with  the  paring  techniques  being  used,  which  deci¬ 
sion  tree  is  actually  larger. 

The  arc  chosen  for  inclusion  is, in  each  case  the  next  arc  to  be 
brought  in  by  Prim's  MST  Algorithm  (25);  i.e.,  the  shortest  arc 
connecting  a  node  to  some  node  connected  to  Vq  by  required  arcs. 
This  has  several  advantages: 

1.  The  arc  chosen,  if  excluded,  will  tend  to  raise  the  lower 
bound.  This  is  important  as  it  helps  control  the  size  of  the 
decision  tree.  Since  a  potentially  large  number  of  succes¬ 
sive  arc  exclusions  is  possible,  it  is  important  that  an  arc 
exclusion  result  in  an  increase  of  the  lower  bound  as  often 
as  possible  so  that  the  fathoming  process  will  limit  the 
depth  of  the  search. 

2.  If  the  arc  is  of  the  form  (Vq,  v.),  it  need  only  be  consid¬ 
ered  as  "required"  and  not  "prohibited."  This  is  a  direct 
consequence  of  Theorem  1. 

3.  If  the  arc  (v.,  v.)  is  prohibited,  and  hence,  v.  is  excluded 
from  the  subtree  containing  v^,  then  so  are  all  arcs  of  the 
form  (V.,  Vj^),  where  v^^  is  forced  into  the  same  subtree  as 
Vj.  This  is  a  direct  consequence  of  an  Elias  and  Ferguson 
result. 
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We  omit  the  details  of  the  remainder  of  the  implementation  of  the 
arc  restricting  procedure  as  they  are  similar  to  the  node  restricting 
procedure  described  above. 
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V.  COMPUTATIONAL  EXPERIENCE 

The  procedures  described  in  the  previous  two  sections  were 
coded  in  FORTRAN  and  run  a  PDP-10.  In  this  secton,  we  discuss 
the  results  of  experiments  run  to  test  the  behavior  of  their  run  time 
and  effectiveness  as  a  function  of  problem  size  and  constraint  tight¬ 
ness. 

Problems  were  generated  by  reading  in  n  and  m  and  generating 
random  X  and  Y  coordinates  for  the  nodes  within  a  unit  square.  The 
location  of  the  center  was,  in  various  problems,  either  random, 
centered,  or  in  the  corner.  Euclidean  distances  were  used.  Most 
experiments  were  run  with  the  center  at  the  geographic  center  of  the 
unit  square;  in  this  way,  larger  problems  could  be  examined. 

Several  series  of  problems  were  run  with  identical  values  of  n 
and  m  (and,  of  course,  different  randomly  generated  points)  to  see 
how  stable  the  running  time  is  from  one  problem  to  another.  The 
standard  deviation  was  found  to  be  close  to  the  mean  for  the  problems 
run.  This  essentially  sa'  _hat  we  should  not  pay  close  attention  to 
exact  run  times  or  the  exact  number  of  subproblems  examined. 

Series  of  problems  were  run  varying  n  and  m  and  using  both  the 
node  restricting  and  arc  restricting  procedures.  Both  procedures 
were  run  with  the  identical  problems  and,  furthermore,  the  same  set 
of  nodes  was  used  (v/ith  new  nodes  added  as  the  problem  size  grew) 
for  all  problems  in  this  series.  This  results  of  this  experiment  are 
shown  in  Table  1.  .‘\s  can  be  seen,  the.  running  times  for  both 
procedures  were  comparable  and  run  time  grows  exponentially  with 
problem  size.  (It  was  gratifying  to  find  that  the  optimal  solution 
values  found  by  both  procedures  always  matched!) 
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It  has  already  been  mentioned  that  these  procedures  yield  lower 
bounds  which  are  at  least  as  great  as  those  obtainable  using  uncon¬ 
strained  MST's.  This  was  verified  empirically  by  actually  generating 
lower  bounds  with  MST’s  as  the  algorithm  proceeded.  The  program 
was  to  print  any  exceptions,  i.e.,  any  times  where  the  .MST  gener¬ 
ated  a  higher  lower  bound;  none  occurred.  Figure  3  shows  some 
typical  lower  bound  values  obtained  using  partitioning  and  MST's.  .\s 
can  be  seen,  not  only  are  the  partitioning  lower  bounds  greater,  but 
they  grow  more  quickly  with  depth.  This  is  significant,  as  a  linear 
increase  in  lower  bound  value  will  reduce  the  run  time  exponentially. 

To  measure  the  impact  of  the  difference  in  lower  bounds  between 
the  .MST  and  partitioning  methods,  several  problems  were  run  first 
with  Lhe  partitioning  meLhod  and  then  with  the  MST  method  of  lower 
bounding.  The  results  of  this  experiment  are  shown  in  Table  2.  .\s 

can  be  seen,  the  partitioning  algorithm  examines  a  much  smaller 
number  of  subproblems,  and  apparently,  its  effectiveness  increases  as 
the  problems  grow  larger.  Thus,  although  it  is  somewhat  more  diffi¬ 
cult  to  evaluate  the  lower  bound  using  the  partitioning  algorithm  than 
it  IS  using  an  .MST,  it  is  le.-s  than  n  times  as  hard  to  do  so.  The 
reduction  in  the  number  of  subproblems  which  .must  be  e.xamined 
appears  to  be  sufficiently  great  to  warrant  the  use  of  the  partiuoning 
technique.  Indeed,  as  the  problem  size  grows,  its  attractiveness 
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.''AD-A131  674  RESEARCH  IN  NET  MANAGEMENT  TECHNIQUES  FOR  TACTICAL  DATA 
NETtfORKS(U)  POLYTECHNIC  INST  OF  NEW  YORK  BROOKLYN 
R  BQORSTYN  ET  AL .  SEP  02  CECOM-80-0579-3 
UNCLASSIFIED  DAAK80-80-K-057g  F/G  t7/2 


VI.  SUMMARY  AND  CONCLUSIONS 

The  purpose  of  this  study  was  to  develop  improved  exact  tech¬ 
niques  for  the  solution  of  the  CMST  problem  (a  model  of  the  multidrop 
line  problem)  so  that  known  heuristics  for  the  solution  to  that  prob¬ 
lem  could  be  examined  on  a  broader  class  of  problems  and  so  that  new 
heuristics  could  be  developed  on  the  basis  of  what  was  learned. 
Much  of  this  happened.  An  improved  exact  technique,  based  upon 
generating  lower  bounds  using  partitioning  instead  of  MST's,  was 
developed  and  computational  experiments  were  run  using  it.  The 
bounds  yielded  by  these  techniques  were  tighter  than  those  yields  by 
the  MST  based  techniques,  and  hence,  the  number  of  subproblems 
which  had  to  be  examined  in  order  to  obtain  a  solution  was  smaller. 
Indeed,  the  decrease  in  the  number  of  subproblems  examined  more 
than  compensated  for  the  increased  effort  required  for  the  examina¬ 
tion  of  each  subproblem.  Thus,  the  new  techniques  served  their 
purpose  in  that  they  permitted  the  examination  of  problems  not  care¬ 
fully  examined  before.  In  particular,  it  was  possible  to  examine 
problems  with  very  tight  constraints,  although  it  was  not  possible  to 
examine  problems  of  substantially  greater  size  than  had  been  pre¬ 
viously  examined. 

Even  with  the  improved  technique,  the  growth  of  run  time  with 
respect  to  problem  size  was  found  to  be  exponential,  albeit  of  a  lower 
order  than  previously  know  exact  techniques  and  of  a  much  lower 
order  than  the  solution  space.  Thus,  one  cannot  use  the  technique 
for  large  problems.  A  number  of  acceleration  techniques  were  devel¬ 
oped  and  incorporated  into  the  procedure. 
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Thus,  it  was  possible  to  examine  sufficiently  interesting  problems 
using  the  exact  technique  to  make  several  insights  into  the  problem. 
The  first  is  that  the  performance  of  the  known  heuristic  degrades  as 
the  constraint  tightness  increases  and  improves  as  the  problem  size 
increases.  An  imporved  heuristic  [33]  was  also  developed  on  the 
basis  of  this  study. 
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INTERSECTION 


security  classification  of  this  page  /•WiM  Dmim  Enitnd) 


REPORT  DOCUMENTATION  PAGE 


1.  REPORT  number 

CECOM-80-0579-3 


4.  TITLE  (mtd  Subtttt*) 

Research  In  Net  Management  Techniques  for 
Tactical  Data  Networks 


7  AuTMonri; 

Dr.  Robert  Boors tyn 
Dr .  Aaron  Kershenbaum 


9.  performing  opganization  name  ano  address 
Polytechnic  Institute  of  New  York 
333  Jay  Street 
Brooklyn,  New  York  11201 


II.  controlling  office  name  and  address 
CDR,  CECOM 
DRSEL-COM-RF-2 

Ft.  Monmouth,  New  Jersey  07703 


14.  monitoring  agency  NA.UE  S  AOORESSf/f  dlllatant  Irom  Controlling  Ollico) 


READ  INSTRUCTIONS 
BEFORE  COMPLETING  FORM 


3.  RECIPIENT’S  CATALOG  NUMBER 


s.  type  of  report  a  period  covered 


3rd  Semiannual 


6.  PERFORMING  OTG.  REPORT  NUMBER 


8.  CONTRACT  OR  GRANT  NUMBERCa; 


DAAK  80-80-K-0579 


10.  PROGRAM  element.  PROJECT,  TASK 
AREA  a  WORK  UNIT  NUMBERS 


JL^.61102.AH48.DF01 


12.  REPORT  DATE 

September  1982 


13.  NUMBER  OF  PAGES 

83 


IS.  SECURITY  CLASS,  (ol  Ih  > 

UNCLASSIFIED 


ISw.  OECL  ASSIFICATION.'COWNCPAOINC 
SCHEDULE 


IS.  distribution  STATEMENT  'al  this  Rapott) 

Approved  for  Public  Release; 
Distribution  Unlimited. 


17,  distribution  statement  (oi  th0  *b9trmtt  in  Block  20,  it  dliUrent  from  Roport) 


19.  KEY  WORDS  (Continum  on  rororam  aido  it  nocoswmry  and  by  block  nutrbar) 

Packet  Radio,  Packet  Networks,  Packet  Switching,  Network  Management, 
Communications  Protocols 


20.r  ABSTRACT  fContinua  on  rararaa  alda  II  nacaaamry  and  Idantitr  by  block  numbarj 


-^is 


This  report  Is  the  third  semi-annual  report  covering  research  In  Packet  Radio 
communication  networks  design  and  analysis.  Presented  herein  is  an  approach 
for  the  throughput  analysis  of  arbitrary  multi-hop  packet  radio  networks  a 
teleprocessing  network  design  technique,  and  a  mathematical  technique  for 
combinational  organization  problgoa. 


Lgpia^ 


DO  I  JAN*71  1473  COITION  OF  I  NOV  88  I 


SECURITY  CLASSIFICATION  OF  THIS  PACE  Tirhwi  Data  Bniarad) 


k 


Cl :  ' 
DRSEL-CGM-RF  * 
DRSEL-CQM-RF-2  (10  Copies 3  ' 
DRS2L-CCM-D  i 


